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NOVEL COMPOUNDS 

Field of the Invention : 

This invention relates to newly identified polynucleotides, polypeptides encoded by 
such polynucleotides, the uses of such polynucleotides and polypeptides, as well as the 
5 production of such polynucleotides and polypeptides and recombinant host cells 
transformed with the polynucleotides. This invention also relates to inhibiting the 
biosynthesis or action of such polynucleotides or polypeptides and to the use of such 
inhibitors in therapy. Preferred embodiments of the invention include Streptococcal 
polypeptides and polynucleotides, particularly those of Streptococcus pneumoniae. 

10 Background of the Invention: 

The Streptococci make up a medically important genera of microbes known to 
cause several types of disease in humans, including otitis media, pneumonia and meningitis. 
Since its isolation more than 100 years ago, Streptococcus pneumoniae has been one of the 
more intensively studied microbes. For example, much of our early understanding that 

1 5 DNA is, in fact, the genetic material was predicated on the work of Griffith and of Avery, 
Macleod and McCarty using this microbe. Despite the vast amount of research with S. 
pneumoniae, many questions concerning the virulence of this microbe remain. 

While certain Streptococcal factors associated with pathogenicity have been 
identified, e.g., capsule polysaccharides, peptidoglycans, pneumolysins, PspA Complement 

20 factor H binding component, autolysin, neuraminidase, peptide permeases, hydrogen 
peroxide, IgAl protease, the list is certainly not complete. Further very little is known 
concerning the temporal expression of such genes during infection and disease progression 
in a mammalian host. Discovering the sets of genes the bacterium is likely to be expressing 
at the different stages of infection, particularly when an infection is established, provides 

25 critical information for the screening and characterization of novel antibacterials which can 
interrupt pathogenesis. In addition to providing a fuller understanding of known proteins, 
such an approach will identify previously unrecognised targets. 
Brief Description of the Invention: 

This invention provides novel proteins, particularly those from Streptococcus 

30 pneumoniae, strain 0100993, characterised in that it comprises the amino acid sequences 
given herein or fragments, analogues or derivatives thereof. 

In accordance with another aspect of the present invention, there are provided 
polynucleotides (DNA or RNA) which encode such polypeptides. 
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In particular the invention provides polynucleotides having the DNA sequences 
given herein. 

The invention also relates to novel oligonucleotides derived from the sequences 
given herein which can act as PCR primers in the process herein described to determine 
whether or not the Streptococcus pneumoniae gene identified herein in whole or in part is 
expressed in infected tissue. It is recognised that such sequences will also have utility in 
diagnosis of the stage of infection and type of infection the pathogen has attained. The 
proteins so identified are also useful as targets in screens designed to identify antimicrobial 
compounds. 

It is an object of the invention to provide polypeptides that have been identified as novel 
polypeptides by homology between the amino acid sequence set out in the Sequence Listing and 
a known amino acid sequence or sequences of other proteins such as the proteins identified under 
the heading Identity in Table 1. 

It is a further object of the invention to provide polynucleotides that encode polypeptides 
15 set forth in the Sequence Listing, particularly polynucleotides that encode the polypeptide set 
forth in the Sequence Listing. 

In a particularly preferred embodiment of the invention the polynucleotide comprises a 
region encoding polypeptides comprising the sequence set out in the Sequence Listing, or a 
variant thereof. 

20 In another particularly preferred embodiment of the invention there is a novel 

protein from comprising an amino acid sequence of the Sequence Listing, or a variant 
thereof. 

In accordance with another aspect of the invention, there is provided the use of a 
polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 
25 genetic immunization. Among the particularly preferred embodiments of the invention are 
naturally occurring allelic variants the polynucleotides set forth in the Sequence Listing and 
polypeptides encoded-thereby. 

In accordance with yet another aspect of the invention, there are provided inhibitors 
to such polypeptides, useful as antibacterial agents, including, for example, antibodies. 

In accordance with certain preferred embodiments of the invention, there are provided 
products, compositions and methods for assessing expression of the sequences the Sequence 
Listing, treating disease, for example, otitis media, conjunctivitis, pneumonia, bacteremia, 
meningitis, sinusitis, pleural empyema and endocarditis, and most particularly meningitis, 
such as for example infection of cerebrospinal fluid, assaying genetic variation, and 
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administering a polypeptide or polynucleotide of the invention to an organism to raise an 
immunological response against a bacteria, especially a Streptococcus pneumoniae bacteria. 

In accordance with certain preferred embodiments of this and other aspects of the 
invention there are provided polynucleotides that hybridize to polynucleotide sequences of the 
invention, particularly under stringent conditions. 

In certain preferred embodiments of the invention there are provided antibodies against 
polypeptides of the invention. 

In other embodiments of the invention there are provided methods for identifying 
compounds which bind to or otherwise interact with and inhibit or activate an activity of a 
polypeptide or polynucleotide of the invention comprising: contacting a polypeptide or 
polynucleotide of the invention with a compound to be screened under conditions to permit 
binding to or other interaction between the compound and the polypeptide or polynucleotide to 
assess the binding to or other interaction with the compound, such binding or interaction being 
associated with a second component capable of providing a detectable signal in response to the 
binding or interaction of the polypeptide or polynucleotide with the compound; and fctermining 
whether the compound binds to or otherwise interacts with and activates or inhibits an activity of 
the polypetide or polynucleotide by detecting the presence or absence of a signal generated from 
the binding or interaction of the compound with the polypeptide or polynucleotide. 

In accordance with yet another aspect of the invention, there are provided agonists and 
antagonists of the polynucleotide and/or polypeptides of the invention, preferably bacteriostatic 
or bacteriocidal agonists and antagonists. 

In a further aspect of the invention there are provided compositions comprising a 
polynucleotide or a polypeptide of the invention for administration to a cell or to a multicellular 
organism. 

Various changes and modifications within the spirit and scope of the disclosed invention 
will become readily apparent to those skilled in the art from reading the following descriptions 
and from reading the other parts of the present disclosure. 
Detailed Description of the Invention; 

Each of the DNA sequences provided herein may be used in the discovery and 
development of antibacterial compounds. Because each of the sequences contains an open 
reading frame (ORF) with appropriate initiation and termination codons, the encoded 
protein upon expression can be used as a target for the screening of antimicrobial drugs. 
Additionally, the DNA sequences encoding preferably the amino terminal regions of the 
encoded protein or the Shine-Delgarno region can be used to construct antisense sequences 
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to control the expression of the coding sequence of interest. Furthermore, many of the 
sequences disclosed herein also provide regions upstream and downstream from the 
encoding sequence. These sequences are useful as a source of regulatory elements for the 
control of bacterial gene expression. Such sequences are conveniently isolated by 
5 restriction enzyme action or synthesized chemically and introduced, for example, into 
promoter identification strains. These strains contain a reporter structural gene sequence 
located downstream from a restriction site such that if an active promoter is inserted, the 
reporter gene will be expressed. 

Although each of the sequences may be employed as described above, this 
10 invention also provides several means for identifying particularly useful target genes. The 
first of these approaches entails searching appropriate databases for sequence matches in 
related organisms. Thus, if a homologue exists, the Streptococcal-like form of this gene 
would likely play an analogous role. For example, a Streptococcal protein identified as 
homologous to a cell surface protein in another organism would be useful as a vaccine 
15 candidate. To the extent such homologies have been identified for the sequences disclosed 
herein they are reported along with the encoding sequence. 

Recently techniques have become available to evaluate temporal gene expression in 
bacteria, particularly as it applies to viability under laboratory and infection conditions. A 
number of methods can be used to identify genes which are essential to survival per se, or 
20 essential to the establishment/maintenance of an infection. Identification of an ORF 
unknown by one of these methods yields additional information about its function and 
permits the selection of such an ORF for further development as a screening target. Briefly, 
these approaches include: 
1) Signature Tagged Mutagenesis (STM) 
25 This technique is described by Hensel el al.. Science 269: 400-403(1995). the 

contents of which is incorporated by reference for background purposes. 
Signature tagged mutagenesis identifies genes necessary for the establishment/maintenance 
of infection in a given infection model. 

The basis of the technique is the random mutagenesis of target organism by various 
30 means (e.g.. transposons) such that unique DNA sequence tags are inserted in close 

proximity to the site of mutation. The tags from a mixed population of bacterial mutants 
and bacteria recovered from an infected hosts are detected by amplification, radiolabeling 
and hybridisation analysis. Mutants attenuated in virulence are revealed by absence of the 
tag from the pool of bacteria recovered from infected hosts. 
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In Streptococcus pneumoniae because the transposon system is less well 
developed, a more efficient way of creating the tagged mutants is to use the insertion- 
duplication mutagenesis technique as described by Morrison et al. B L Bacteriol. 159:870 
(1984) the contents of which is incorporated by reference for background purposes. 
2) In Vivo Expression Technology (TVET) 

This technique is described by Camilli & al, Eg£. iiatl Acad. Sfil USA . 91 :2634- 
2638 (1994), the contents of which is incorporated by reference for background purposes. 
IVET identifies genes up-regulated during infection when compared to laboratory 
cultivation, implying an important role in infection. ORF identified by this technique are 
implied to have a significant role in infection establishment/maintenance. 

In this technique random chromosomal fragments of target organism are cloned 
upstream of a promoter-less recombinase gene in a plasmid vector. This construct is 
introduced into the target organism which carries an antibiotic resistance gene flanked by 
resolvase sites. Growth in the presence of the antibiotic removes from the population those 
fragments cloned into the plasmid vector capable of supporting transcription of the 
recombinase gene and therefore have caused loss of antibiotic resistance. The resistant pool 
is introduced into a host and at various times after infection bacteria may be recovered and 
assessed for the presence of antibiotic resistance. The chromosomal fragment carried by 
each antibiotic sensitive bacterium should carry a promoter or portion of a gene normally 
upregulated during infection. Sequencing upstream of the recombinase gene allows 
identification of the up regulated gene. 
3) Differential display 

This technique is described by Chuangeiai., J. Bacteriol 175:2026-2036(1993), 
the contents of which is incorporated by reference for background purposes. This method 
identifies those genes which are expressed in an organism by identifying mRNA present 
using randomly-primed RT-PCR. By comparing pre-infection and post infection profiles, 
genes up and down regulated during infection can be identified and the RT-PCR product 
sequenced and matched to ORF 'unknowns*. 
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4) Generation of conditional lethal mutants by transposon mutagenesis. 

This technique, described by de Lorenzo, V. ej a]., Qsds. 123: 17-24 (1993); 

Neuwald. A. F. a]., fiens 125: 69-73(1993); and Takiff, H. E. a]., L£a£tClioJ. 

174:1544-1553(1992), the contents of which is incoiporated by reference for 

background purposes, identifies genes whose expression are essential for cell viability. 
In this technique transposons carrying controllable promoters, which provide 

transcription outward from the transposon in one or both directions, are generated. Random 
insertion of these transposons into target organisms and subsequent isolation of insertion 
mutants in the presence of inducer of promoter activity ensures that insertions which 
separate promoter from coding region of a gene whose expression is essential for cell 
viability will be recovered. Subsequent replica plating in the absence of inducer identifies 
such insertions, since they fail to survive. Sequencing of the flanking regions of the 
transposon allows identification of site of insertion and identification of the gene disrupted. 
Close monitoring of the changes in cellular processes/morphology during growth in the 
absence of inducer yields information on likely function of the gene. Such monitoring 
could include flow cytometry (cell division, lysis, redox potential. DNA replication), 
incorporation of radiochemical^ labeled precursors into DNA, RNA. protein, lipid, 
peptidoglycan. monitoring reporter enzyme gene fusions which respond to known cellular 
stresses. 

5) Generation of conditional lethal mutants by chemical mutagenesis. 

This technique is described by Beckwith. J.. Method in Fn, yrfn | nr , 2 04: 
3-18(1991). the contents of which are incorporated herein by reference for background 
purposes. In this technique random chemical mutagenesis of target organism, growth at 
temperature other than physiological temperature (permissive temperature) and subsequent 
replica plating and growth at different temperature (e.g. 42°C to identify ts. 25°C to identify 
cs) are used to identify those isolates which now fail to grow (conditional mutants). As 
above close monftonng of the changes upon growth at the non-permissive temperature 
yields information on the function of the mutated gene. Complementation of conditional 
lethal mutation by library from target organism and sequencing of complementing gene 
allows matching with unknown ORE 

6) RT-PCR 

Streptococcus pneumoniae messenger RNA is isolated from bacterial infected tissue e.g. 48 
hour murine lung infections, and the amount of each mRNA species assessed by reverse 
transcription of the RNA sample primed with random hexanucleotides followed by PCR 
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with gene specific primer pairs. The determination of the presence and amount of a 
particular mRNA species by quantification of the resultant PCR product provides 
information on the bacterial genes which are transcribed in the infected tissue. Analysis of 
gene transcription can be carried out at different times of infection to gain a detailed 
S knowledge of gene regulation in bacterial pathogenesis allowing for a clearer understanding 
of which gene products represent targets for screens for novel antibacterials. Because of 
the gene specific nature of the PCR primers employed it should be understood that the 
bacterial mRNA preparation need not be free of mammalian RNA. This allows the 
investigator to carry out a simple and quick RNA preparation from infected tissue to 

10 obtain bacterial mRNA species which are very short lived in the bacterium (in the order of 2 
minute halflives). Optimally the bacterial mRNA is prepared from infected murine lung 
tissue by mechanical disruption in the presence of TRIzole (GIBCO-BRL) for very short 
periods of time, subsequent processing according to the manufacturers of TRIzole reagent 
and DNAase treatment to remove contaminating DNA. Preferably the process is optimised 

15 by finding those conditions which give a maximum amount of S.pneumoniae 16S 

ribosomal RNA as detected by probing Northerns with a suitably labelled sequence specific 
oligonucleotide probe. Typically a 5' dye labelled primer is used in each PCR primer pair 
in a PCR reaction which is terminated optimally between 8 and 25 cycles. The PCR 
products are separated on 6% polyacrylamide gels with detection and quantification using 

20 GeneScanner (manufactured by ABI). 

Each of these techniques may have advantages or disadvantage depending on the 
particular application. The skilled artisan would choose the approach that is the most 
relevant with the particular end use in mind. For example, some genes might be recognised 
as essential for infection but in reality are only necessary for the initiation of infection and 

25 so their products would represent relatively unattractive targets for antibacterials developed 
to cure established and chronic infections. 

Use of the of these technologies when applied to the ORFs of the present invention 
enables identification of bacterial proteins expressed during infection, inhibitors of which 
would have utility in anti-bacterial therapy. 

30 Streptococcus pneumoniae, strain 0100993 has been deposited at the National 

Collection of Industrial and Marine Bacteria Ltd. (NCIMB), Aberdeen, Scotland under 
NCIMB number 40794 on 1 1 April 1996, and a Streptococcus pneumoniae, strain 0100993 
DNA library in £. coli was similarly deposited on 17 April 1996 under NCIMB number 
40800. 
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The nucleotide sequences disclosed herein can be obtained by synthetic chemical 
techniques known in the art or can be obtained from Streptococcus pneumoniae, strain 
0100993by probing a DNA preparation with probes constructed from the particular 
sequences disclosed herein. Alternatively, oligonucleotides derived from a disclosed 
sequence can act as PGR primers in a process of PCR-based cloning of the sequence from a 
bacterial genomic source. It is recognised that such sequences will a | so have utility in 
diagnosis of the stage of infection and type of infection the pathogen has attained. 

To obtain the polynucleotide encoding the protein using the DNA sequence given 
herein typically a library of clones of chromosomal DNA of Streptococcus pneumoniae, 
strain 01 00993 in E. coli or some other suitable host is probed with a radiolabeled 
oligonucleotide, preferably a 17mer or longer, derived from the partial sequence. Clones 
carrying DNA identical to that of the probe can then be distinguished using high stringency 
washes. By sequencing the individual clones thus identified with sequencing primers 
designed from the original sequence it is then possible to extend the sequence in both 
directions to determine the full gene sequence. Conveniently such sequencing is performed 
using denatured double stranded DNA prepared from a plasmid clone. Suitable techniques 
are described by Maniatis. T.. Fritsch. E.F. and Sambrook. J. in MOLECULAR CLONING 
A Laboratory Manual. 2nd edition. 1989. Cold Spring Harbor Laboratory (see: Screening ' 
By Hybridization 1.90 and Sequencing Denatured Double-Stranded DNA Templates 
20 13.70). 

A polynucleotide of the present invention may be in the form of RNA or in the 
form of DNA. which DNA includes cDNA, genomic DNA. and synthetic DNA. The DNA 
may be double-stranded or single-stranded, and if single stranded may be the coding strand 
or non-cod.ng (anti-sense) strand. The coding sequence which encodes the polypeptide 
may be identical to the coding sequence shown or may be a different coding sequence 
which coding sequence, as a result of the redundancy or degeneracy of the genetic code, 
encoding the same polypeptide. 

The present invention includes variants of the hereinabove described 
polynucleotides which encode fragments, analogues and derivatives of the polypeptide 
characterized by the deduced amino acid sequence given herein. The variant of the 
polynucleotide may be a naturally occurring allelic variant of the polynucleotide or a non- 
naturally occurring variant of the polynucleotide. In addition to the standard A. G. C. T/U 
representations for nucleic acid bases, the term "N" is also used. «N" means that any of the 
four DNA or RNA bases may appear at such a designated position in the DNA or RNA 
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sequence, except that N cannot be a base that when taken in combination with adjacent 
nucleotide positions, when read in the correct reading frame, would have the effect of 
generating a premature termination codon in such reading frame. 

Thus, the present invention includes polynucleotides encoding the same 
polypeptide characterized by the deduced amino acid sequence given herein as well as 
variants of such polynucleotides which variants encode for a fragment, derivative or 
analogue of the polypeptide. Such nucleotide variants include deletion variants, 
substitution variants and addition or insertion variants. 

The polynucleotide may have a coding sequence which is a naturally occurring 
allelic variant of the coding sequence characterized by the DNA sequence disclosed herein. 
As known in the art, an allelic variant is an alternate form of a polynucleotide sequence 
which may have a substitution, deletion or addition of one or more nucleotides, which does 
not substantially alter the function of the encoded polypeptide. 

The polynucleotide which encodes for the mature polypeptide, may include only 
the coding sequence for the mature polypeptide or the coding sequence for the mature 
polypeptide and additional coding sequence such as a leader or secretory sequence or a 
proprotein sequence. 

Thus, the term "polynucleotide encoding a polypeptide" encompasses a 
polynucleotide which includes only coding sequence for the polypeptide as well as a 
polynucleotide which includes additional coding and/or non-coding sequence. 

The present invention therefore includes polynucleotides, wherein the coding 
sequence for the mature polypeptide may be fused in the same reading frame to a 
polynucleotide sequence which aids in expression and secretion of a polypeptide from a 
host cell, for example, a leader sequence which functions as a secretory sequence for 
controlling transport of a polypeptide from the ceil. The polypeptide having a leader 
sequence is a preprotein and may have the leader sequence cleaved by the host cell to form 
the mature form of the polypeptide. The polynucleotides may also encode for a proprotein 
which is the mature protein plus additional 5' amino acid residues. A mature protein having 
a prosequence is a proprotein and may be an inactive form of the protein. Once the 
prosequence is cleaved an active mature protein remains. 

■Rius, for example, the polynucleotide of the present invention may code for a 
mature protein, or for a protein having a prosequence or for a protein having both a 
prosequence and a presequence (leader sequence). Further, the amino acid sequences 
provided herein show a methionine residue at the NH r terminus. It is appreciated, however. 
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that during post-translational modification of the peptide, this residue may be deleted. 
Accordingly, this invention contemplates the use of both the methionine-containing and the 
methionineless amino terminal variants of each protein disclosed herein. 

The polynucleotides of the present invention may also have the coding sequence 
fused in frame to a marker sequence at either the 5* or 3' terminus of the gene which allows 
for purification of the polypeptide of the present invention. The marker sequence may be a 
hexa-histidine tag supplied by the pQE series of vectors (supplied commercially by Quiagen 
Inc.) to provide for purification of the polypeptide fused to the marker in the case of a 
bacterial host. 

The present invention further relates to polynucleotides which hybridize to the 
hereinabove-described sequences if there is at least 50% and preferably at least 70% identity 
between the sequences. The present invention particularly relates to polynucleotides, 
particularly Streptococcal polynucleoudes, which hybridize under stringent condition to 
the hereinabove-described polynucleotides . As herein used, the term "stringent conditions- 
means hybridization will occur only if there is at least 95% and preferably at least 97% 
identity between the sequences. The polynucleotides which hybridize to the hereinabove 
described polynucleotides in a preferred embodiment encode polypeptides which retain 
substantially the same biological function or activity as the polypeptide characterised by the 
deduced amino acid sequence given herein. An example of stringent hybridization 
conditions is overnight incubation at 42°C in a solution comprising: 50% formamide. 5x 
SSC (l50mM NaCl, 15mM trisodium citrate). 50 mM sodium phosphate (pH7.6), 5x 
Denhardfs solution. 10% dextran sulfate, and 20 micrograms/ml denatured, sheared salmon 
sperm DNA, followed by washing the hybridization support in O.lx SSC at about 65°C. 
Hybridization and wash conditions are well known and exemplified in Sambrook. et al.. 
Molecular Cloning: A Laboratory Manual, Second Edition. Cold Spring Harbor, N.Y.. 
(1989), particularly Chapter 1 1 therein. 

The invention also provides a polynucleotide consisting essentially of a 
polynucleotide sequence obtainable by screening an appropriate library containing the 
complete gene for a polynucleotide sequence selected from the group consisting of the 
polynucleotides of the Sequence Listing under stringent hybridization conditions with a 
probe having the sequence of said polynucleotide sequence set forth in said polynucleotide 
of the Sequence Listing or a fragment thereof; and isolating said DNA sequence. 
Fragments useful for obtaining such a polynucleotide include, for example, probes and 
primers described elsewhere herein. 
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"Identity/' as known in the art and used herein, is a relationship between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined by comparing 
the sequences. In the art, "identity" also means the degree of sequence relatedness between 
polypeptide or polynucleotide sequences, as the case may be, as determined by the match 
5 between strings of such sequences. "Identity" and "similarity" can be readily calculated by 
known methods, including but not limited to those described in (Computational Molecular 
Biology. Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing: 
Informatics and Genome Projects* Smith, D.W., ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Part I, Griffin, A.M., and Griffin, H.G., eds., 

10 Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., 
Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., 
M Stockton Press, New York, 1991 ; and Carillo, H„ and Lipman, D., SLAM / Applied 
Math. t 48: 1073 (1988). Preferred methods to determine identity are designed to give the 
largest match between the sequences tested. Methods to determine identity and similarity 

15 are codified in publicly available computer programs. Preferred computer program methods 
to determine identity and similarity between two sequences include, but are not limited to, 
the GCG program package (Devereux, J., et al.. Nucleic Acids Research 12(1): 387 (1984)), 
BLASTP, BLASTN, and FASTA (Atschul, S.F. et al., / Molec. Biol. 215: 403-410 (1990). 
The BLAST X program is publicly available from NCBI and other sources (BLAST 

20 Manual Altschul, S., et al, NCBI NLM NIH Bethesda, MD 20894; Altschul, S., et ai, J. 
Mol Biol 215: 403-410 (1990). 

As an illustration, by a polynucleotide having a nucleotide sequence having at least, 
for example, 95% "identity" to a reference nucleotide sequence selected from the group 
consisting of the polynucleotide of the Sequence Listing is intended that the nucleotide 

25 sequence of the polynucleotide is identical to the reference sequence except that the 

polynucleotide sequence may include up to five point mutations per each 100 nucleotides of 
the reference nucleotide sequence. In other words, to obtain a polynucleotide having a 
nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 5% of 
the nucleotides in the reference sequence may be deleted or substituted with another 

30 nucleotide, or a number of nucleotides up to 5% of the total nucleotides in the reference 
sequence may be inserted into the reference sequence. These mutations of the reference 
sequence may occur at the 5 or 3 terminal positions of the reference nucleotide sequence 
or anywhere between those terminal positions, interspersed either individually among 
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nucleotides in the reference sequence or in one or more contiguous groups within the 
reference sequence. 

Similarly, by a polypeptide having an amino acid sequence having at least, 
for example. 95% identity to a reference amino acid sequence of selected from the 
group consisting of the amino acids of the Sequence Listing is intended that the 
amino acid sequence of the polypeptide is identical to the reference sequence except 
that the polypeptide sequence may include up to Five amino acid alterations per 
each 100 amino acids of the reference amino acid. In other words, to obtain a 
polypeptide having an amino acid sequence at least 95% identical to a reference 
amino acid sequence, up to 5% of the amino acid residues in the reference sequence 
may be deleted or substituted with another amino acid, or a number of amino acids 
up to 5% of the total amino acid residues in the reference sequence may be inserted 
into the reference sequence. These alterations of the reference sequence may occur 
at the amino or carboxy terminal positions of the reference amino acid sequence or 
anywhere between those terminal positions, interspersed either individually among 
residues in the reference sequence or in one or more contiguous groups within the 
reference sequence. 

The deposit referred to herein will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Micro-organisms for purposes of 
Patent Procedure. These deposits are provided merely as convenience to those of skill in 
the art and are not an admission that a deposit is required under 35 U.S.C. §112. The 
sequence of the polynucleotides contained in the deposited material, as well as the amino 
acid sequence of the polypeptides encoded thereby, are incorporated herein by reference and 
are controlling in the event of any conflict with any description of sequences herein. A 
license may be required to make, use or sell the deposited material, and no such license is 
hereby granted. 

The terms fragment,- "derivative" and "analogue" when referring to the 
polypeptide characterized by the deduced amino acid sequence herein, means a polypeptide 
which retains ondill, &e same biological function or activity as such polypeptide. 
Thus, an analogue includes a proprotei'n which can be activated by cleavage of the 
proprotein portion to produce an active mature polypeptide. 

The polypeptide of the present invention may be a recombinant polypeptide, a 
natural polypeptide or a synthetic polypeptide, preferably a recombinant polypeptide. 
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The fragment, derivative or analogue of the polypeptide characterized by the 
deduced amino acid sequence herein may be (i) one in which one or more of the amino acid 
residues are substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or may not be 
one encoded by the genetic code, or (ii) one in which one or more of the amino acid 
residues includes a substituent group, or (iii) one in which the polypeptide is fused with 
another compound, such as a compound to increase the half-life of the polypeptide (for 
example, polyethylene glycol), or (iv) one in which the additional amino acids are fused to 
the polypeptide, such as a leader or secretory sequence or a sequence which is employed for 
purification of the polypeptide or a proprotein sequence. Such fragments, derivatives and 
analogues are deemed to be within the scope of those skilled in the art from the teachings 
herein. 

The polypeptides and polynucleotides of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 

The term "isolated" means that the material is removed from its original 
environment (e.g., the natural environment if it is naturally occurring). For example, a 
naturally-occurring polynucleotide or polypeptide present in a living animal is not isolated, 
but the same polynucleotide or polypeptide, separated from some or all of the coexisting 
materials in the natural system, is isolated. Such polynucleotides could be part of a vector 
and/or such polynucleotides or polypeptides could be part of a composition, and still be 
isolated in that such vector or composition is not part of its natural environment. 

The present invention also relates to vectors which include polynucleotides of the 
present invention, host cells which are genetically engineered with vectors of the invention 
and the production of polypeptides of the invention by recombinant techniques. 

In accordance with yet a further aspect of the present invention, there is therefore 
provided a process for producing the polypeptide of the invention by recombinant 
techniques by expressing a polynucleotide encoding said polypeptide in a host and 
recovering the expressed product. Alternatively, the polypeptides of the invention can be 
synthetically produced by conventional peptide synthesizers. 

Host cells are genetically engineered (transduced or transformed or transfected) 
with the vectors of this invention which may be, for example, a cloning vector or an 
expression vector. The vector may be, for example, in the form of a plasmid, a cosmid, a 
phage, etc. The engineered host cells can be cultured in conventional nutrient media 
modified as appropriate for activating promoters, selecting transformants or amplifying the 
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genes. The culture conditions, such as temperature, pH and the like, are those previously 
used with the host cell selected for expression, and will be apparent to the ordinarily skilled 
artisan. 

Suitable expression vectors include chromosomal, nonchromosomal and synthetic 
DNA sequences, e.g., bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors 
derived from combinations of plasmids and phage DNA. However, any other vector may 
be used as long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into an appropriate restriction 
endonuclease site(s) by procedures known in the art. 

The DNA sequence in the expression vector is operatively linked to an appropriate 
expression control sequence(s) (promoter) to direct mRNA synthesis. As representative 
examples of such promoters, there may be mentioned: LTR or SV40 promoter, the £ coli. 
lac or trp, the phage lambda P L promoter and other promoters known to control expression 
of genes in eukaryotic or prokaryotic ceUs or their viruses. The expression vector also 
contains a ribosome binding site for translation initiation and a transcription terminator. The 
vector may also include appropriate sequences for amplifying expression. 

In addition, the expression vectors preferably contain one or more selectable marker 
genes to provide a phenotypic trait for selection of transformed host cells such as 
dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicillin resistance in E. coli. 

The gene can be placed under the control of a promoter, ribosome binding site (for 
bacterial expression) and. optionally, an operator (collectively referred to herein as "control" 
elements), so that the DNA sequence encoding the desired protein is transcribed into RNA 
in the host cell transformed by a vector containing this expression construction. The coding 
sequence may or may not contain a signal peptide or leader sequence. The polypeptides of 
the present invention can be expressed using, for example, the E. coli tac promoter or the 
protein A gene (spa) promoter and signal sequence. Leader sequences can be removed by 
the bacterial host in post-translationaJ processing. See. e.g., U.S. Patent Nos. 4.431.739; 
4.425.437; 4,338,397. Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 
appropriate vectors are PKK232-8 and PCM7. Particular named bacterial promoters 
include lad. lacZ. T3. T7. gpt, lambda P R . Pl and trp. Eukaryotic promoters include CMV 
immediate early. HSV thymidine kinase, early and late SV40. LTRs from retrovirus, and 
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mouse metallothionein-I. Selection of the appropriate vector and promoter is well within the 
level of ordinary skill in the ait 

In addition to control sequences, it may be desirable to add regulatory sequences 
which allow for regulation of the expression of the protein sequences relative to the growth 
of the host cell. Regulatory sequences are known to those of skill in the art, and examples 
include those which cause the expression of a gene to be turned on or off in response to a 
chemical or physical stimulus, including the presence of a regulatory compound. Other 
types of regulatory elements may also be present in the vector, for example, enhancer 
sequences. 

An expression vector is constructed so that the particular coding sequence is located 
in the vector with the appropriate regulatory sequences, the positioning and orientation of 
the coding sequence with respect to the control sequences being such that the coding 
sequence is transcribed under the "control" of the control sequences (i.e., RNA polymerase 
which binds to the DNA molecule at the control sequences transcribes the coding 
sequence). Modification of the coding sequences may be desirable to achieve this end. For 
example, in some cases it may be necessary to modify the sequence so that it may be 
attached to the control sequences with the appropriate orientation; i.e., to maintain the 
reading frame. The control sequences and other regulatory sequences may be ligated to the 
coding sequence prior to insertion into a vector, such as the cloning vectors described 
above. Alternatively, the coding sequence can be cloned directly into an expression vector 
which already contains the control sequences and an appropriate restriction site. 

Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance 
gene of E. coli and 5. cerevisiae TRP1 gene, and a promoter derived from a highly- 
expressed gene to direct transcription of a downstream structural sequence. The 
heterologous structural sequence is assembled in appropriate phase with translation 
initiation and termiffffioh sequences, and preferably, a leader sequence capable of directing 
secretion of translated protein into the periplasmic space or extracellular medium. 
Optionally, the heterologous sequence can encode a fusion protein including an N-tcrminal 
identification peptide imparting desired characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. 

The vector containing the appropriate DNA sequence as hereinabove described, as 
well as an appropriate promoter or control sequence, may be employed to transform an 
appropriate host to permit the host to express the protein. 
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More particularly, the present invention also includes recombinant constructs 
comprising one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a sequence of the invention 
has been inserted, in a forward or reverse orientation. In a preferred aspect of this 
embodiment, the construct further comprises regulatory sequences, including, for example, 
a promoter, operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art. and are commercially available. The 
following vectors are provided by way of example. Bacterial: pET-3 vectors (Stratagene). 
PQE70. pQE60, pQE-9 (Qiagen). pbs. pDlO. phagescript. psiX174. pbluescript SK. pbsks. 
pNH8A. pNH16a. pNH18A. pNH46A (Stratagene); ptrc99a. pKK223-3, pKK233-3. 
PDR540. pRIT5 (Pharmacia). Eukaryotic: pBlueBacin (Invitrogen). pWLNEO. pSV2CAT. 
rX3G44.pXTl. P SG(Stratagene)r^V10.pBPV.pMSG.pSVL(Pharrnacia). However, 
any other plasmid or vector may be used as long as they are replicable and viable in the 
host. 

Examples of recombinant DNA vectors for cloning and host cells which they can 
transform include the bacteriophage 1 (£. coli). pBR322 (£. colt), pACYC177 (£. coli), 
PKT230 (gram-negative bacteria). pGVl 106 (gram-negative bacteria). pLAFRl (gram- 
negative bacteria). pME290 (non-£. coli gram-negative bacteria). pHVI4 (E. coli and 
Bacillus subtilis), P BD9 (Bacillus), pU61 (Streptomyces), pUC6 (Streptomyces), YIp5 
(Saccharomyces), a baculovirus insect cell system. YCpl9 (Saccharomyces). See. 
generally. "DNA Cloning": Vols. I & II. Glover et al. ed. IRL Press Oxford (1985) (1987) 
and; T. Maniatis et al. ("Molecular Cloning" Cold Spring Harbor Laboratory (1982). 

In some cases, it may be desirable to add sequences which cause the secretion of the 
polypeptide from the host organism, with subsequent cleavage of the secretory signal. 

Polypeptides can be expressed in host cells under the control of appropriate 
promoters. Cell-free translation systems can also be employed to produce such proteins 
using RNAs deriveJTrom the DNA constructs of the present invention. Appropriate 
cloning and expression vectors for use with prokaryotic and eukaryotic hosts are described 
by Sambrock. et al.. Molecular Cloning: A Laboratory Manual. Second Edition. Cold 
Spring Harbor. N.Y.. (1989). the disclosure of which is hereby incorporated by reference. 

Following transformation of a suitable host strain and growth of the host strain to 
an appropriate cell density, the selected promoter is induced by appropriate means (e.g.. 
temperature shift or chemical induction) and cells are cultured for an additional period. 
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Cells are typically harvested by centrifugation. disrupted by physical or chemical 
means, and the resulting crude extract retained for further purification. 

Microbial cells employed in expression of proteins can be disrupted by any 
convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use 
of cell lysing agents, such methods are well known to those skilled in the art. 

Depending on the expression system and host selected, the polypeptide of the 
present invention may be produced by growing host cells transformed by an expression 
vector described above under conditions whereby the polypeptide of interest is expressed. 
The polypeptide is then isolated from the host cells and purified. If the expression system 
secretes the polypeptide into growth media, the polypeptide can be purified directly from 
the media. If the polypeptide is not secreted, it is isolated from cell lysates or recovered 
from the cell membrane fraction. Where the polypeptide is localized to the cJsurface, 
whole cells or isolated membranes can be used as an assayable source of the desired gene 
product. Polypeptide expressed in bacterial hosts such as E. coli may require isolation from 
inclusion bodies and refolding. Where the mature protein has a very hydrophobic region 
which leads to an insoluble product of overexpression. it may be desirable to express a 
truncated protein in which the hydrophobic region has been deleted. The selection of the 
appropriate growth conditions and recovery methods are within the skill of the art. 

The polypeptide can be recovered and purified from recombinant cell cultures by 
methods including ammonium sulphate or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite chromatography and 
lectin chromatography. Protein refolding steps can be used, as necessary, in completing 
configuration of the mature protein. Finally, high performance liquid chromatography 
(HPLC) can be employed for final purification steps. 

Depending upon the host employed in a recombinant production procedure, the 
polypeptides of the present invention may be glycosylated or may be non-glycosylated. 
Polypeptides of the invention may also include an initial methionine amino acid residue. 

A "replicon" is any genetic element (e.g., plasmid. chromosome, virus) that 
functions as an autonomous unit of DNA replication in vivo; i.e.. capable of replication 
under its own control. 

A "vector" is a replicon, such as a plasmid. phage, orcosmid. to which another 
DNA segment may be attached so as to bring about the replication of the attached segment. 
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A "double-stranded DNA molecule" refers to the polymeric form of 
deoxyriboniicleotides (bases adenine, guanine, thymine, or cytosjne) in a double-stranded 
helix, both relaxed and supercoiled. This term refers only to the primary and secondary 
structure of the molecule, and does not limit it to any particular tertiary forms. Thus, this 
5 term includes double-stranded DNA found, inter alia, in linear DNA molecules (e.g.. 
restriction fragments), viruses, plasmids. and chromosomes. In discussing the structure of 
particular double-stranded DNA molecules, sequences may be described herein according to 
the normal convention of giving only the sequence in the 5' to 3' direction along the 
nontranscribed strand of DNA (i.e.. the strand having the sequence homologous to the 
10 mRNA). 

A DNA "coding sequence or or a "nucleotide sequence encoding" a particular 
protein, is a DNA sequence which is transcribed and translated into a polypeptide when 
placed under the control of appropriate regulatory sequences. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA 
polymerase in a cell and initiating transcription of a downstream (3' direction) coding 
sequence. For purposes of defining the present invention, the promoter sequence is bound 
at the 3' terminus by a translation start codon (e.g.. ATG) of a coding sequence and extends 
upstream (5' direction) to include the minimum number of bases or elements necessary to 
initiate transcription at levels detectable above background. Within the promoter sequence 
will be found a transcription initiation site (conveniently defined by mapping with nuclease 
SI), as well as protein binding domains (consensus sequences) responsible for the binding 
of RNA polymerase. Eukaryotic promoters will often, but not always, contain "TATA" 
boxes and "CAT' boxes. Prokaryotic promoters contain Shine-Dalgamo sequences in 
addition to the -10 and -35 consensus sequences. 

DNA "control sequences" refers collectively to promoter sequences, ribosome 
binding sites, polyadenylation signals, transcription termination sequences, upstream 
regulatory domains, enhancers, and the like, which collectively provide for the expression 
(i.e., the transcription and translation) of a coding sequence in a host cell. 

A control sequence "directs the expression" of a coding sequence in a cell when 
RNA polymerase will bind the promoter sequence and transcribe the coding sequence into 
mRNA. which is then translated into the polypeptide encoded by the coding sequence. 

A "host cell" is a cell which has been transformed or transfected. or is capable of 
transformation or transfection by an exogenous DNA sequence. 
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A celJ has been "transformed" by exogenous DNA when such exogenous DNA has 
been introduced inside the cell membrane. Exogenous DNA may or may not be integrated 
(covalently linked) into chromosomal DNA making up the genome of the cell. In 
prokaryotes and yeasts, for example, the exogenous DNA may be maintained on an 
episomal element, such as a plasmid. With respect to eukaryotic cells, a stably transformed 
or transfected cell is one in which the exogenous DNA has become integrated into the 
chromosome so that it is inherited by daughter cells through chromosome replication. This 
stability is demonstrated by the ability of the eukaryotic cell to establish cell lines or clones 
comprised of a population of daughter cell containing the exogenous DNA. 

A "clone" is a population of cells derived from a single cell or common ancestor by 
mitosis. A "cell line" is a clone of a primary cell that is capable of stable growth in vitro for 
many generations. 

A "heterologous" region of a DNA construct is an identifiable segment of DNA 
within or attached to another DNA molecule that is not found in association with the other 
molecule in nature. 

This invention is also related to the use of the polynucleotides of the invention for use as 
diagnostic reagents. Detection of a polynucleotide or polypeptide of the invention in a 
eukaiyote, particularly a mammal, and especially a human, will provide a diagnostic method for 
diagnosis of a disease. Eukaiyotes (herein also "individual(s)"), particularly mammals, and 
especially humans, infected with an organism comprising a polynucleotide of the invention may 
be detected at the nucleic acid level by a variety of techniques. 

Nucleic acids for diagnosis may be obtained from an infected individual's cells and 
tissues, such as bone, blood, muscle, cartilage, and skin. Genomic DNA may be used directly for 
detection or may be amplified enzymatically by using PCR or other amplification technique prior 
to analysis. RNA or cDNA may also be used in the same ways. Using amplification, 
characterization of the species and strain of prokaiyote present in an individual, may be made by 
an analysis of the genotype of the prokaiyote gene. Deletions and insertions can be detected by a 
change in size of the amplified product in comparison to the genotype of a reference sequence. 
Point mutations can be identified by hybridizing amplified DNA to labeled polynucleotide 
sequences of the invention. Perfectly matched sequences can be distinguished from mismatched 
duplexes by RNase digestion or by differences in melting temperatures. DNA sequence 
differences may also be detected by alterations in the electrophoretic mobility of the DNA 
fragments in gels, with or without denaturing agents, or by direct DNA sequencing. See, e.g., 
Myers et al., Science, 230: 1242 (1985). Sequence changes at specific locations also may be 
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revealed by nuclease protection assays, such as RNase and SI protection or a chemical cleavage 

method. See. e.g.. Cotton et aJ., Proc. Natl. Acad. Sci. USA. 85: 4397^401 

(1985). 

Cells carrying mutations or polymorphisms in the gene of the invention may also be 
detected at the DNA level by a variety of techniques, to allow for serotyping. for example. For 
example. RT-PCR can be used to detect mutations. It is particularly preferred to used RT-PCR 
in conjunction with automated detection systems, such as. for example. GeneScan. RNA or 
cDNA may also be used for the same purpose. PCR or RT-PCR. As an example. PCR primers 
complementary to a nucleic acid encoding a polypeptide of the invention can be used to identify 
and analyze mutations. These primers may be used for. among othe4r things, amplifying DNA 
isolated from a sample derived from an individual. The primers may be used to amplify the gene 
isolated from an infected individual such that the gene may then be subject to various techniques 
for elucidation of the DNA sequence. In this way. mutations in the DNA sequence may be 
detected and used to diagnose infection and to serotype and/or classify the infectious agent. 

The invention further provides a process for diagnosing, disease, preferably bacterial 
infections, more preferably infections by Streptococcus pneumoniae, comprising determining 
from a sample derived from an individual a increased level of expression of polynucleotide 
having a sequence set forth in the Sequence Listing or a sequence of the invention. 
Increased or decreased expression of a polynucleotide of the invention can be measured 
using any on of the methods well known in the art for the quantation of polynucleotides, 
such as. for example, amplification. PCR. RT-PCR. RNase protection. Northern blotting 
and other hybridization methods. 

In addition, a diagnostic assay in accordance with the invention for detecting over- 
expression of a polypeptide of the invention compared to normal control tissue samples may be 
used to detect the presence of ah infection, for example. Assay techniques that can be used to 
determine levels of a polypeptide of the invention, in a sample derived from a host are well- 
known to those of slan in the art Such assay methods include radioimmunoassays, competitive- 
binding assays. Western Blot analysis and ELBA assays. 

In accordance with yet a further aspect of the present invention, there is provided 
the use of a polypeptide of the invention for therapeutic or prophylactic purposes, for 
example, as an antibacterial agent or a vaccine. 

In accordance with another aspect of the present invention, there is provided the use 
of a polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 
genetic immunisation. 
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In accordance with yet another aspect of the present invention, there are provided 
inhibitors to such polypeptides, useful as antibacterial agents. In particular, there are 
provided antibodies against such polypeptides. 

Polypeptides of the invention may also be used to assess the binding of small molecule 
substrates and ligands in, for example, cells, cell-free preparations, chemical libraries, and natural 
product mixtures. These substrates and ligands may be natural substrates and ligands or may be 
structural or functional mimetics. See. e.g., Coligan et aL, Current Protocols in Immunology 
/^.Chapters (1991). 

The invention also provides a method of screening compounds to identify those which 
enhance (agonist) or block (antagonist) the action of polypeptides or polynucleotides of the 
invention, particularly those compounds that are bacteriostatic and/or bacteriocidal. The method 
of screening may involve high-throughput techniques. Forexample. to screen for agonists or 
antagoists. asyndetic reaction mix. a cellular compartment, such as a membrane, cell envelope 
or cell wall, or a preparation of any thereof, comprising a polypeptide of the invention and a 
labeled substrate or ligand of such polypeptide is incubated in the absence or the presence of a 
candidate molecule that may be an agonist or antagonist of a polypeptide of the invention. The 
ability of the candidate molecule to agonize or antagonize a polypeptide of the invention is 
reflected in decreased binding of the labeled ligand or decreased production of product from such 
substrate. Molecules that bind gratuitously. U., without inducing the effects of a polypeptide of 
the invention are most likely to be good antagonists. Molecules that bind well and increase the 
rate of product production from substrate are agonists. Detection of the rate or level of production 
of product from substrate may be enhanced by using a reporter system. Reporter systems that 
may be useful in this regard include but are not limited to colorimetric labeled substrate 
converted into product, a reporter gene that is responsive to changes in polynucleotide or 
polypeptide activity, and binding assays known in the art. 

Another example of an assay for antagonists of a polypeptide of the invention is a 
competitive assay thaf combines such polypeptide and a potential antagonist with polypeptide- 
binding molecules, recombinant polypeptide-binding molecules, natural substrates or ligands. or 
substrate or ligand mimetics. under appropriate conditions for a competitive inhibition assay. 
Polypeptides of the invention can be labeled, such as by radioactivity or a colorimetric 
compound, such that the number of such polypeptide molecules bound to a binding molecule or 
converted to product can be determined accurately to assess the effectiveness of the potential 
antagonist 
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Potential antagonists include small organic molecules, peptides, polypeptides and 
antibodies that bind to a polynucleotide or polypeptide of the invention and thereby inhibit or 
extinguish its activity. Potential antagonists also may be small organic molecules, a peptide, a 
polypeptide such as a closely related protein or antibody that binds the same sites on a binding 
molecule, without inducing an activities of a polypeptide of the invention, thereby preventing the 
action of such polypeptide by excluding it from binding. 

Potential antagonists include a small molecule that binds to and occupies the binding 
site of the polypeptide thereby preventing binding to cellular binding molecules, such that 
normal biological activity is prevented. Examples of small molecules include but are not limited 
to small organic molecules, peptides or peptide-like molecules. Other potential antagonists 
include antisense molecules (see Okano, / Neurochem. 56: 560 (1991); 
OUGODEOXYNUCLEOTIDES AS ANT I SENSE INHIBITORS OF GENE EXPRESSION. CRC 
Press. Boca Raton. FL (1988). for a description of these molecules). Preferred potential 
antagonists include compounds related to and variants of the polypetides of the invention. 

Each of the DNA sequences provided herein may be used in the discovery and 
development of antibacterial compounds. The encoded protein, upon expression, can be 
used as a target for the screening of antibacterial drugs. Additionally, the DNA sequences 
encoding the amino terminal regions of the encoded protein or Shine-Delgarno or other 
translation facilitating sequences of the respective mRNA can be used to construct antisense 
sequences to control the expression of the coding sequence of interest. 

The invention also provides the use of the polypeptide, polynucleotide or inhibitor 
of the invention to interfere with the initial physical interaction between a pathogen and 
mammalian host responsible for sequelae of infection. In particular the molecules of the 
invention may be used: in the prevention of adhesion of bacteria, in particular gram positive 
bacteria, to mammalian extracellular matrix proteins on in-dwelling devices or to 
extracellular matrix proteins in wounds; to block protein-mediated mammalian cell invasion 
by. for example, inulating phosphorylation of mammalian tyrosine kinases (Rosenshine et 
al.. Infect. Immun. 60:2211 (1992); to block bacterial adhesion between mammalian 
extracellular matrix proteins and bacterial proteins that mediate tissue damage and; to block 
the normal progression of pathogenesis in infections initiated other than by the implantation 
of in-dwelling devices or by other surgical techniques. 

The antagonists and agonists of the invention may be employed, for instance, to inhibit 
and treat otitis media, conjunctivitis, pneumonia, bacteremia, meningitis, sinusitis, pleural 
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empyema and endocarditis, and most particularly meningitis, such as for example infection of 
cerebrospinal fluid. 

Another aspect of the invention is a pharmaceutical composition comprising the 
above polypeptide, polynucleotide or inhibitor of the invention and a pharmaceutically 
5 acceptable carrier. 

In a particular aspect the invention provides the use of an inhibitor of the invention 
as an antibacterial agent. 

The invention further relates to the manufacture of a medicament for such uses. 

The polypeptide may be used as an antigen for vaccination of a host to produce 
10 specific antibodies which have anti-bacterial action. This invention also contemplates the 
use of the DNA encoding the antigen as a component in a DNA vaccine as discussed more 
fully below. 

The polypeptides or cells expressing them can be used as an immunogen to produce 
antibodies thereto. These antibodies can be, for example, polyclonal or monoclonal 
15 antibodies. The term antibodies also includes chimeric, single chain, and humanized 

antibodies, as well as Fab fragments, or the product of an Fab expression library. Various 
procedures known in the art may be used for the production of such antibodies and 
fragments. 

Antibodies generated against the polypeptides of the present invention can be 
20 obtained by direct injection of the polypeptides into an animal or by administering the 

polypeptides to an animal, preferably a nonhuman. The antibody so obtained will then bind 
the polypeptides itself. In this manner, even a sequence encoding only a fragment of the 
polypeptides can be used to generate antibodies binding the whole native polypeptides. 
Such antibodies can then be used to isolate the polypeptide from tissue expressing that 
25 polypeptide. 

Polypeptide derivatives include antigenically or immunologically equivalent 
derivatives which form a particular aspect of this invention. 

The term 'antigenically equivalent derivative' as used herein encompasses a 
polypeptide or its equivalent which will be specifically recognised by certain antibodies 
30 which, when raised to the protein or polypeptide according to the present invention, 
interfere with the interaction between pathogen and mammalian host. 

The term 'immunologically equivalent derivative 1 as used herein encompasses a 
peptide or its equivalent which when used in a suitable formulation to raise antibodies in a 
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vertebrate, the antibodies act to interfere with the interaction between pathogen and 
mammalian host. 

In particular derivatives which are slightly longer or slightly shorter than the native 
protein or polypeptide fragment of the present invention may be used. In addition, 
polypeptides in which one or more of the amino acid residues are modified may be used. 
Such peptides may, for example, be prepared by substitution, addition, or rearrangement of 
amino acids or by chemical modification thereof. All such substitutions and modifications 
are generally well known to those skilled in the art of peptide chemistry. 

The polypeptide, such as an antigenically or immunologically equivalent derivative 
or a fusion protein thereof is used as an antigen to immunize a mouse or other animal such 
as a rat or chicken. The fusion protein may provide stability to the polypeptide. The 
antigen may be associated, for example by conjugation , with an immunogenic carrier 
protein for example bovine serum albumin (BS A) or keyhole limpet haemocyanin (KLH). 
Alternatively a multiple antigenic peptide comprising multiple copies of the protein or 
polypeptide, or an antigenically or immunologically equivalent polypeptide thereof may be 
sufficiently antigenic to improve immunogenicity so as to obviate the use of a carrier. 

For preparation of monoclonal antibodies, any technique which provides antibodies 
produced by continuous cell line cultures can be used. Examples include the hybridoma 
technique (Kohler and Milstein, JialUK, 256:495-497(1975)), the trioma technique, the 
human B-cell hybridoma technique (Kozboret al.. Immunology J^ y A ^XQU^y t he 
EBV-hybridoma technique to produce human monoclonal antibodies (Cole, et al. t 1985, in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

Techniques described for the production of single chain antibodies (U.S. Patent 
4,946,778) can be adapted to produce single chain antibodies to immunogenic polypeptide 
products of this invention. 

Using the procedure of Kohler and Milstein (fiUBa, (1975)), antibody-containing 
cells from the immuifised mammal are fused with myeloma cells to create hybridoma cells 
secreting monoclonal antibodies. 

The hybridomas are screened to select a ceil line with high binding affinity and 
favorable cross reaction with other Streptococcal species using one or more of the original 
polypeptide and/or the fusion protein. The selected cell line is cultured to obtain the desired 
Mab. 

Hybridoma cell lines secreting the monoclonal antibody are another aspect of this 
invention. 
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Alternatively phage display technology could be utilised to select antibody genes 
with binding activities towards the polypeptide either from repertoires of PCR amplified v- 
genes of lymphocytes from humans screened for possessing anti-Fbp or from naive libraries 
(McCafferty. J. etal, NaOUS 348:552-554(1990). and Marks, J. etai. Biotechnology 
10:779-783(1992)). The affinity of these antibodies can also be improved by chain 
shuffling (Clackson, T. et al.. Nature 352:624-628(1991)). 

The antibody should be screened again for high affinity to the polypeptide and/or 
fusion protein. 

As mentioned above, a fragment of the final antibody may be prepared. 
The antibody may be either intact antibody of M r approx 150,000 or a derivative of 
it, for example a Fab fragment or a Fv fragment as described in Skerra, A and Pluckthun, 
A., SffifiQCfi 240: 1038-1040 (1988). If two antigen binding domains are present each 
domain may be directed against a different epitope - termed •bispecific* antibodies. 

The antibody of the invention may be prepared by conventional means for example 
by established monoclonal antibody technology (Kohler, G. and Milstein, C. (sum. (1975) 
or using recombinant means e.g. combinatorial libraries, for example as described in Huse, 
W.D. etal., Sfiisnsfi 246:1275-1281 (1989). 

Preferably the antibody is prepared by expression of a DNA polymer encoding said 
antibody in an appropriate expression system such as described above for the expression of 
polypeptides of the invention. The choice of vector for the expression system will be 
determined in part by the host, which may be a prokaryotic cell, such as E. colt (preferably 
strain B) or Streptomyces sp. or a eukaryotic cell, such as a mouse C127, mouse myeloma, 
human HeLa, Chinese hamster ovary, filamentous or unicellular fungi or insect cell. The 
host may also be a transgenic animal or a transgenic plant (for example, as described in 
Hiatt, A. et al., Own 340:76-78(1989). Suitable vectors include plasmids. bacteriophages, 
cosmids and recombinant viruses, derived from, for example, baculoviruses and vaccinia. 

The Fab-fragment may also be prepared from its parent monoclonal antibody by 
enzyme treatment, for example using papain to cleave the Fab portion from the Fc portion. 

Preferably the antibody or derivative thereof is modified to make it less 
immunogenic in the patient. For example, if the patient is human the antibody may most 
preferably be ■humanised'; where the complimentarity determining region(s) of the 
hybridoma-derived antibody has been transplanted into a human monoclonal antibody . for 
example as described in Jones, P. et al, MailStt.32 1:522-525 (1986), or Tempest etal.. 
Biotechnology 9:266-273 (1991). 
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The modification need not be restricted to one of 'humanisation' ; other primate 
sequences (for example Newman, R. et at.. Biotechnology 10:1455-1460 (1992)) may also 
be used. 

The humanised monoclonal antibody, or its fragment having binding activity, form 
5 a particular aspect of this invention. 

This invention provides a method of screening drugs to identify those which 
interfere with the proteins selected as targets herein, which method comprises measuring the 
interference of the activity of the protein by a test drag. For example if the protein selected 
has a catalytic activity, after suitable purification and formulation the activity of the enzyme 

10 can be followed by its ability to convert its natural substrates. By incorporating different 
chemically synthesised test compounds or natural products into such an assay of enzymatic 
activity one is able to detect those additives which compete with the natural substrate or 
otherwise inhibit enzymatic activity. 

The invention also relates to inhibitors identified thereby. 

15 The use of a polynucleotide of the invention in genetic immunisation will 

preferably employ a suitable delivery method such as direct injection of plasmid DNA into 
muscles (Wolff et a/., Hum, Moi. Cfenfil 1:363 (1992); Mamhorpe et a/., Hum. Gene Ther . 
4:419 (1963)), delivery of DNA complexed with specific protein carriers ( Wu etaL. L 
Biol, Chsm. 264:16985 (1989)), coprecipitation of DNA with calcium phosphate 

20 (Benvenisty & Reshef, Proc.Nafl Aead. Sei lift A 83:9551 (1986)). encapsulation of DNA 
in various forms of liposomes (Kaneda et a/., Science 243:375 (1989)), particle 
bombardment (Tang et aL t Matms 356:152 (1992)); Eisenbraun et aL, DNA Cell Bin! 
12:791 (1993)) and in vivo infection using cloned retroviral vectors (Seeger et al. % Proc. 
Nat'l. Acad. Sci. USA 81:5849 (1984)). Suitable promoters for muscle transfection include 

25 CMV, RSV, SRa, actin, MCK, alpha globin, adenovirus and dihydrofolate reductase. 

In therapy or as a prophylactic, the active agent i.e., the polypeptide, polynucleotide 
or inhibitor of the invention, may be administered to a patient as an injectable composition, 
for example as a sterile aqueous dispersion, preferably isotonic. 

Alternatively the composition may be formulated for topical application 

30 for example in the form of ointments, creams, lotions, eye ointments, eye drops, ear drops, 
mouthwash, impregnated dressings and sutures and aerosols, and may contain appropriate 
conventional additives, including, for example, preservatives, solvents to assist drug 
penetration, and emollients in ointments and creams. Such topical formulations may also 
contain compatible conventional carriers, for example cream or ointment bases, and ethanol 
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or oleyl alcohol for lotions. Such carriers may constitute from about 1 % to about 98% by 
weight of the formulation; more usually they will constitute up to about 80% by weight of 
the formulation. 

For administration to human patients, it is expected that the daily dosage level of 
the active agent will be from 0.01 to 10 mg/kg, typically around 1 mg/kg. The physician in 
any event will determine the actual dosage which will be most suitable for an individual 
patient and will vary with the age, weight and response of the particular patient. The above 
dosages are exemplary of the average case. There can, of course, be individual instances 
where higher or lower dosage ranges are merited, and such are within the scope of this 
invention. 

A vaccine composition is conveniently in injectable form. Conventional adjuvants 
may be employed to enhance the immune response. 

A suitable unit dose for vaccination is 0.5-5 ug/kg of antigen, and such dose is 
preferably administered 1-3 times and with an interval of 1-3 weeks. 

Within the indicated dosage range, no adverse toxicologicals effects are expected 
with the compounds of the invention which would preclude their administration to suitable 
patients. 

The individual full length sequences given herein are summarized in the following 
Tables (Table 1 and Table 2). Under the column in Table 1 labeled "Identity" there is the 
deduced identity of each open reading frame of the invention determined using Blastp 
and/or MPSearch. The ORF# column indicates whether the polynucleotide encoding each 
ORF encodes more than one ORF. For example. SEQ ID NO: 263 has an ORF# 1 but no 
#2; thus the polynucleotide encodes this ORF#l , but no other ORF was detected. On the 
other hand. SEQ ID NO:286 and 287 have ORF#s 1 and 2 respectively, indicating that they 
were both encoded by the same polynucleotide, this can also be seen in Table 2 where the 
polynucleotide of SEQ ID NO: 24 encodes the ORFs of SEQ ID NOS: 286 and 287. Table 
1 also shows in the position ("POSITION" columns) the start ("START" column) and stop 
("STOP" column) codons for each ORF and their positions in the encoding polynucleotide 
sequence. The SEQ ID NOS of the polypeptides of this table are linked to both a deduced 
identity in this table and a polynucleotide sequence in Table 2 which encodes each 
polypeptide. The "Direction" column in Table 1 shows the direction of the ORF encoding 
each poypetide in this table. "Forward" denotes the sense orientation and "Reverse" 
denotes the antisense orientation of the ORF. Table 1 also provides an "Assembly ID" 
which is a unique numerical descriptor for each polynucleotide sequence of the invention. 
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IABLEJL 











Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


263. 


probable 
transposase 
(insertion 
sequence IS861) 
- Streptococcus 
agalactiae (strain 
COH-1) 


3112224 


1 


ATG 


TAA 


258 


746 


Forward 


264. 


Unknown 


3112506 


1 


-CAT 


CTA- 


414 


554 


Reverse 


265. 


SUCCINYL- 

DIAMINOPIM 

ELATE 

DESUCCINYL 
ASE (EC 
3.5.1.18) 
(SDAP). - 
ESCHERICHIA 
COLI . 


3112574 


1 


-CAT 


TTA- 


607 


990 


Reverse 


266. 


Unknown 


3112646 


1 


ATG 


TGA 


353 


460 


'orward 


267. 


)- alanine 
permease 
[dagA) homolog 
- Haemophilus 
influenzae 
[strain Rd 
KW20) 


31 126R6 


i 
i 




1 AA 


393 


1235 


forward 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


268. 


Unknown 


3112810 


1 


-CAT 


TTA- 


328 


468 


Reverse 


269. 


Unknown 


3112934 


1 


ATG 


TAG 


107 


739 


Forward 


270. 


3-OXOACYL- 

[ACYL- 

CARRJER 

PROTEIN] 

REDUCTASE 

PRECURSOR 

(EC 1.1.1.100) 

(3-KETOACYL 

-ACYL 

CARRIER 

PROTEIN 

REDUCTASE). 

-CUPHEA 

LANCEOLATA 


3112956 


1 


ATG 


TGA 


586 


1146 


Forward 


271. 


HIGH- 
AFFINITY 
BRANCHED- 
CHAIN AMINO 
ACID 

TRANSPORT 
ATP-BDMDING 
PROTEIN 
LIVG (LIV -I 
PROTEEMG). - 
SALMONELL A 

rYPHiMURiu 

M. 


3112994 


1 


ATG 


TAA 


307 


648 


Forward 


272. 


Jnknown 


3113026 


1 




rGA : 


>35 i 


183 


? orward 
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iCodon 



iPosition 



ISEQ ID 
NO: 



IDENTITY 



273. 



274. 



275. 



Assembly |ORF#|Start 
ID 



Stop Start 



Stop 



Direction 



Unknown 



3113098 1 



-CAT 



CELL 

DIVISION 

PROTEIN 

FTSH. - 

BACILLUS 

SUBTILIS. 



TTA~ 1 28 



3113274 1 



360 



ATG 



Reverse 



TGA 125 



700 



Forward 



I Unknown 



3113306 1 



-CAT 



CTA- 255 



509 



Reverse 



PTS SYSTEM, 
FRUCTOSE- 
SPECIFIC IIBC 
COMPONENT 
(EHBC-FRU) 
(FRUCTOSE 
PERMEASE H 
BC 

COMPONENT) 
(PHOSPHOTR 
ANSFERASE 
ENZYME II, 
BC 

COMPONENT) 
(EC 2.7. 1.69) 
(EH -FRU). - 
XANTHOMON 
AS 

CAMPESTRIS 
(PV. 

CAMPESTRIS) 



3113406 1 



-CAT 



TCA- 237 



725 



Reverse 



277. 
278. 



I Unknown 



113432 II 



-CAT 



CTA- 17 



310 



Reverse 



lUnknown 



3113436 1 



CTG 



TAA 1 



441 



'orward 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


279. 


HOLLIDAY 
JUNCTION 

pVVT A 

DNA 

ricLICASE 
KUVB. - 
ESCHERICHIA 
COLI. 


3113510 


1 


ATG 


TGA 


187 


411 


Forward 


280. 


Unknown 


3113514 


1 


ATG 


TGA 


361 


49^ 


Forward I 


281. 


Unknown 


3113546 


1 


TTG 


TAA 


2 


241 


Forward 1 


282. 


CELL 

DIVISION 

PROTEIN 

FTSA. - 

BACILLUS 

SUBTELIS. 


3113610 


1 


TTG 


TGA 


3 


665 


Forward 1 


283. 


Unknown 


3113692 




ATG 


TAG 


304 


594 


"orward 1 


284. 


GALACTOSE- 
6-PHOSPHATE 
ISOMERASE 
LACB 

SUBUNIT (EC 
5.-.-.-). . 

LACTOCOCCU 
S LACTIS 
[SUBSP. 
LACTIS) 
[STREPTOCOC 
CUS LACTIS). 


3113696 




ATG 


TGA 


176 


661 


'orward 1 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


285. 


3- 

ISOPROPYLM 
ALATE 
DEHYDROGE 
NASE (EC 
1.1.1.85) 
(BETA-IPM 
DEHYDROGE 
NASE) (IMDH). 

LACTOCOCCU 
o LACTIS 
(SUBSP. 
LACTIS) 
(STREPTOCOC 
CUS LACTIS). 


3113762 


1 


~CAG 


TTA~ 


275 


508 


Reverse 
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Codon 




Position 






SEQ ID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


286. 


PHOSPHO-2- 

DEHYDRO-3- 

DEOXYHEPTO 

NATE 

ALDOLASE 

(EC 4.1.2.15) 

(PHOSPHO- 2- 

KETO-3-DE 

OXYHEPTON 

ATE 

ALDOLASE) 
(DAHP 

SYNTHETASE) 

(3-DEOXY- D- 

ARABINO- 

HEPTULOSON 

ATE 7- 

PHOSPHATE 

SYNTHASE). - 

CORYNEBACT 

ERIUM 

GLUTAMICU 

M. 


3113794 


1 


ATG 


TAG 


40 


219 


Forward 
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Codon 



Position 



ISEQID 
NO: 



IDENTITY 



Assembly |ORF# 
ID 



Start Stop 



Start 



Stop 



Direction 



287. 



PHOSPHO-2- 
DEHYDRO-3- 
DEOXYHEPTO 
NATE 

ALDOLASE, 
PHE- 

SENSITIVE 
(EC 4. 1.2. 15) 
(PHOSP HO-2 
KETO-3- 
DEOXYHEPTO 
NATE 

ALDOLASE) 
(DAHP 
SYNTHETASE) 
(3-DEOXY-D- 
ARABINO-HEP 
TULOSONATE 
7-PHOSPHATE 
SYNTHASE). - 
ESCHERICHIA 
COLI. 



3113794 2 



ATG TGA 



283 



453 



Forward 



PYRUVATE 3113802 |1 
KINASE (EC 
.7.1.40).- 
LACTOCOCCU 

LACTIS 
(SUBSR. 
LACTIS) 
(STREPTOCOC 
CUS LACnS). 



ATG TGA 



69 



260 



Forward 



nknown 



3113990 1 



-CAT ITTA- 



164 



Reverse 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


290. 


ADAPTIVE- 
RESPONSE 
SENSORY- 
KIN ASE SASA 
(EC 2.7.-.-). - 
SYNECHOCOC 
CUS SP. 
(STRAIN PCC 
7942) 

(ANACYSTIS 

NIDULANS 

R2). 


3114082 


1 


ATG 


TAG 


22 


189 


Forward 


291. 


FOLYLPOLYG 

LUTAMATE 

SYNTHASE 

(EC 6.3.2.17) 

(FOLYLPOLY- 

GAMMA- 

GLUTAMATE 

SYNTHETASE 

) (FPGS). - 

LACTOBACIL 

LUS CASEI. 


3114096 


1 


CTG 


TGA 


1 


243 


Forward 


292. 


50S 

RIBOSOMAL 
PROTEIN L6 
(BL10). - 
BACILLUS 
STEAROTHER 
MOPHILUS. 


3174146 


1 


-CAT 


TCA- 


581 


949 


Reverse 
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Codon 



Position 



SEQ ID 

NO: 



IDENTITY 



Assembly 
ID 



ORF 



Start 



Stop 



Start 



Stop Direction 



grpE protein 
Lactococcus 
lactis 



3174148 



-CAT 



TTA- 



256 



780 Reverse 



298. 



Unknown 
ribosomaJ 
protein S14 
(rpS14) 
tiomolog - 
Haemophilus 
influenzae 
(strain Rd 
KW20) 



3174150 
3174152 



CAT 
CAT 



TCA 
TTA- 



347 
137 



472 Reverse 
391 Reverse 



50S (3174154 
RIBOSOMAL 
PROTEIN L3. - 
BACILLUS 
STEAROTHER 
MOPHILUS. 



PHOSPHATE |3174166 
TRANSPORT 
SYSTEM 
REGULATORYl 
PROTEIN. - 

ESCHERICHIA I 
COLI. 



ATG 



PROBABLE 
TRANSKETOL 
ASE (EC 
2.2.1.1) (TK).- 
STREPTOCOC 
CUS 

PNEUMONIAE 



3174184 



CAT 



-CAT 



TAA 



441 



TTA- 



TTA- 



101 



895 



1067 I Forward 



751 Reverse 



350 Reverse 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


299. 


ntpJ protein - 
Enterococcus 
hirae 


3174206 


1 


TTG 


TAG 


2 


763 


Forward 


300. 


Unknown 


3174208 


1 


ATG 


TAA 


311 


1054 


Forward 


301. 


Unknown 


3174210 


1 


-CAT 


TTA~ 


290 


2305 


Reverse 


302. 


ACYL 
CARRIER 
PROTEIN. - 
CRYPTOMON 
AS PHI. 


3174220 


1 


ATG 


TAG 


788 


1021 


Forward 


303. 


ISL2 protein - 

Lactobacillus 

helveticus 


3174224 


1 


-CAT 


TTA- 


447 


737 


Reverse 


304. 


ISL2 protein - 

Lactobacillus 

helveticus 


3174226 


1 


ATG 


TAG 


850 


1164 


'. "orward 


305. 


Unknown 


3174228 


1 


CTG 


TAA 




240 


Forward 


306. 


Unknown 


3174228 


2 


ATG 


TAG 


278 


580 


Forward 


307. 


Possible Ca2+- 

transporting 

ATPase 


3174236 


1 


-CAT 


TTA- 


53 


241 


Reverse 


308. 


Possible Ca2+- 

transporting 

ATPase 


3174238 




-CAT 


TTA- 


53 


529 


Reverse 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


313. 


PROLYL- 
TRNA 

SYNTHETASE 

(EC 6.1.1.15) 

(PROLINE— 

TRNA 

LIGASE) 

(PRORS) 

(GLOBAL RNA 

SYNTHESIS 

FACTOR). - 

ESCHERICHIA 

COLI. 


3174284 


1 


ATG 


TAA 


1009 


2103 


Forward 


314. 


BSCELABCD 
NCBI gi: 8957 - 
Bacillus subtilis. 


3174288 


1 


ATG 


TAA 


78 


464 


Forward 


315. 


GLUT AMINE 

TRANSPORT 

ATP-BINDING 

PROTEIN 

GLNQ. - 

ESCHERICHIA 

COLI. 


3174294 




ATG 


TGA 


370 


981 


Forward 


316. 


Unknown 


3174294 


2 


ATG 


TAA 


1044 


1217 


; orward 


317. 


GLUT AMINE 
TRANSPORT 
ATP-BINDING 
PROTEIN 

BACILLUS 

STEAROTHER 

MOPHILUS. 


3174298 




ATG 


TAA 


431 


976 


'orward 
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Codon 



JPosition 



ISEQID 
NO: 



IDENTITY 



Assembly |ORF# 
ID 



Start 



Stop Start Stop 



Direction 



318. 



319. 

Eo7 



GLUTAMINE |3 174298 |2 
TRANSPORT 
ATP-BINDING 
PROTEIN 
GLNQ. - 
BACILLUS 
STEAROTHER 
MOPHILUS. 



ATG 



TAA 922 



1179 



Forward 



Unknown 



13174302 1 



-CAT 



TTA- 322 



1320 



leverse 



Unknown 



13174302 2 



-CAT 



CTA- 1486 



1857 



leverse 



321. 



322. 
323. 



Unknown 



13174314 1 



Unknown 



ATG_TGAl523 



13174314 2 



ATG 



891 



forward 



TAG 1888 



1370 



forward 



RIBOFLAVIN 
SYNTHASE 
ALPHA CHAIN | 
EC 2.5.1.9).- 
BACILLUS 
SUBTELIS. 



3174328 1 



-CAT 



CTA- 213 



653 



Reverse 



324. 



5,10- 
METHYLENETl 

etrahydrofI 

OLATE 
REDUCTASE 
(EC 1.7.99.5).- 
SALMONELLAl 
TYPHIMURIU 
M. 



3174342 1 



ATG 



TAA 372 



1238 



toward 



325. 



326. 



Unknown 



13174356 1 



CAA 



LSSAKACNP |3174368 |1 
NCBI gi: 
695615 - 
Lactobacillus 
sake. 



CA- 584 



1342 



Reverse 



ATG 



AA 219 



539 



Forward 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


327. 


agrB protein - 

Staphylococcus 

aureus 


3174368 


2 


ATG 


TGA 


730 


1056 


Forward 


328. 


RESD 
PROTEIN. - 
BACILLUS 
SUBTILIS. 


3174372 


1 


CTG 


TGA 


1 


315 


Forward 


329. 


PROBABLE 

TRANSCRIPTI 

ONAL 

REGULATORY 

PROTEIN 

ENDR.- 

BACILLUS 

POLYMYXA. 


3174384 


1 


-CAT 


TCA- 


239 


418 


Reverse 


330. 


Jnknown 


3174384 


2 


-CAT 


CTA- 


514 


1074 


Reverse 


331. 


30S 

RIBOSOMAL 
PROTEIN S2. - 
ESCHERICHIA 
COLI. 


3174390 




ATG 


TAA 


597 


1457 


T orward 


332. 

I 


GLUT AMINE 
TRANSPORT 

A TP-Tl TMniwr: 
f\ I r -DxiHmJmIHkj 

PROTEST 
GLNQ. - 
ESCHERICHIA 
COLI. 


3174402 




ATG 


TGA 


372 


980 


: orward 

1 
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MCodon 



Position 



SEQID 
NO: 



IDENTITY (Assembly |ORF#|Start Stop 
I ID 



Start 



Stop 



Direction 



333. 



334. 



335. 



336. 



GTP-BINDING |3174420 
PROTEIN ERA 
HOMOLOG. 
STREPTOCOC 
CUS MUTANS. 



1 



-CAT 



TTA- 



592 



1086 



Reverse 



GTP-BINDING |3 174420 
PROTEIN ERA 
KOMOLOG. 
STREPTOCOC 
CUS MUTANS. I 



-CAT 



TTA- 



1022 



1492 



Reverse 



SULFATE 
TRANSPORT 
ATP-BINDING 
PROTEIN 
CYSA. - 
SYNECHOCOCl 
CUS SP. 
(STRAIN PCC 
794 2) 

(ANACYSTIS 

NIDULANS 

R2). 



3174426 1 



IATG 



TGA 



812 



1270 



-orward 



NITRATE |3174426 |2 
TRANSPORT 
PROTEIN 
NASD. 

KLEBSIELLA 
PNEUMONIAE I 



ATG 



TAG 



1298 



1543 



forward 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly OI 
ID 


IF # Start 


Stop 


Start 


Stop 


Direction 


337. 


PROLIPOPROl 
EIN 

DIACYLGLYC 
ERYL 

TRANSFERAS 
E (EC 2.4.99.-). 
- 

SALMONELLA 

TYPHIMURI 

UM. 


' 3174428 1 


-CAT 


TTA- 


35 


835 


Reverse 


338. 


Unknown 


3174444 1 


ATG 


TAA 


164 


895 


Forward 


339. 


CYSTEINYL- 
TRNA 

SYNTHETASE 
(EC 6.1.1.16) 
(CYSTEINE— 
TRNA 
LIGASE) 
(CYSRS). - 
BACI LLUS 
SUBTILIS. 


3174454 1 


-CAT 


TCA- 


504 


1079 


Reverse 


340. 


Unknown 


3174460 1 


ATG 


TAG 


529 


1275 


Forward 


341. 

< 


PRIMOSOMAL 
PROTEIN N' 

N FACTOR Y). 

ESCHERICHIA 
COLI. 


3174462 1 


TTG 


tga : 


2 


721 


; orward 
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Codon 




Position 






SEQ ID 

NO: 


IDENTITY 


Assembly 
ID 


ORF« 


! Start 


Stop 


Start 


Stop 


Direction 1 


342. 


30S 

RD30SOMAL 
PROTEIN SI 1 
(BS11).- 
BACILLUS 
SUBTELIS. 


3174466 


1 


-CAT 


TTA- 


1019 


1303 


Reverse 


343. 


Unknown 


3174474 


1 


-CAT 


TTA- 


238 


473 


Reverse 


344. 


Unknown 


3174476 


1 


-CAT 


TTA- 


241 


1446 


Reverse 


345. 


PEPTIDE 

CHAIN 

RELEASE 

FACTOR 3 

(RF-3). - 

BACTEROIDE 

S NODOSUS 

(DICHELOBAC 

TERNOD 

OSUS). 


3174490 


1 


-CAT 


CTA- 


668 


1291 


Reverse 1 


346. 


mesl protein - 
^euconostoc 
mesenteroides 


3174496 




ATG 


TAG 


812 


1666 


Forward J 


347. 


Unknown 


3174506 




ATG 


TGA 


179 


352 


Forward 1 


348. 

] 

] 


GALACTOKIN 
ASE (EC 
2.7.1.6).- 
LACTOBACIL 
LUS 

HELVETICUS. 


3174514 




-CAT ' 


TCA- 


209 


340 


Reverse 1 


349 i 

4 

] 
J 


JALAL 1 VJiSjJN „ 

\SE (EC 
i.7.1.6). - 
BACILLUS 
JUBTILIS. 


> 1745 14 i 


I 


-CAT ' 


[TA- 


1177 


1359 ] 


Reverse 
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Codon 




Position 




SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF4 


1 Start 


Stop 


Start 


Stop [Direction 


350. 


FORMATE 
ACETYLTRAN 
SFERASE 1 
(EC 2.3.1.54) 
(PYRUVATE 
FORMATE- 
LYASE 1). - 
ESCHERI 
CHIA COLI. 


3174524 


1 


-CAT 


TTA- 


19 


867 (Reverse 


351. 


Unknown 


3174546 


1 


-CAT 


CTA- 


912 


1127 [Reverse 


352. 


CACSPCNCBI 
gi: 899232 - 
Clostridium 
acetobutylicum. 


3174550 


1 


ATG 


TAG 


645 


956 (Forward 


353. 


Unknown 


3174562 




CTG 


TAG 




504 Forward 


354. 


KETOACYL 

REDUCTASE 

HETN(EC 

1.3.1.-).- 

ANABAENA 

SP. (STRAIN 

PCC 7120). 


3174562 


2 


ATG 


TAA 


525 


1280 Forward 


355. 


Unknown 


3174570 




ATG 


rAG 


249 


593 [Forward 


356. 


ATP- 

DEPENDENT 
DNA 

RECG (EC 
J.6.I.-).- 

escherichia 
:oli. 


3174570 


2 


ATG ' 


fGA I 


504 


695 Forward 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


357. 


PROTEIN 
DLTD 

PRECURSOR. - 

BACILLUS 

SUBTILIS. 


3174580 


1 


ATG 


TGA 


3 


611 


Forward 


358. 


ALANYL- 
TRNA 

SYNTHETASE 
(EC 6.1.1.7) 
(ALANINE- 
TRNA 
LIGASE) 
(ALARS). - 
ESCHERICH 
IA COLL 


3174582 


1 


ATG 


TGA 


537 


875 


Forward 


359. 

d 

1 


PTS SYSTEM, 

MANNOSE- 

SPECIFIC nc 

COMPONENT 

(EHC-MAN) 

(MANNOSE- 

PERMEASE 

ncco 

MPONENT) 
PHOSPHOTR 
ANSFERASE 
ENZYME H, C 
-OMPONENT) 
EII-P-MAN). - 
ESCHERICH 
[A COLI. 


3174586 




ATG 


TGA 


533 


865 


■■orward 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORFfl 


Start 


Stop 


Start 


Stop 


Direction 


360. 


PTS SYSTEM, 
FRUCTOSE- 
SPECIFIC DC 
COMPONENT 
(ETC-FRU) 
(FRUCTOSE- 
PERMEASE 

nc 

COMPONENT) 
(PHOSPHOTR 
ANSFERASE 
ENZYME n, C 
COMPONENT) 
(P28). - 
BACILLUS 
SUBT ELIS. 


3174586 


2 


ATG 


TAA 


747 


1172 


Forward 


361. 


URIDYLATE 
KINASE (EC 
2.7.4.-) (UK) 
(URIDINE 
MONOPHOSP 
HATE 

KINASE) (UMP 

KINASE) 

(SMBA 

PROTEIN). - 

ESCHERICHIA 

COLL 


3174594 




-CAT 


TTA- 


319 


579 


Reverse 


362. 


Jnknown " ; 


J 174606 


1 l 


CTG ' 


rAA 


1 


153 


: orward 


363. 


Jnknown 


i 174608 


i 


\TG ' 


rAA : 


536 


!63 


; orward 


364. 


Jnknown '. 


J 174634 




-CAT 


7TA- ! 


>9 


$08 


Reverse 


365. 


Jnknown ; 


) 174634 


I 


-CAT 


rcA~ : 


J05 


J26 


Reverse 



47 



WO 97/43303 



PCT/US97/07950 











Codon 




Position 






SEQID 
NO: 


UIUJNTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


366. 


5- 

METHYLTETR 

AHYDROPTER 

OYLTRIGLUT 

AMATE— 

HOMOCYSTEI 

NE 

METHYLTRA 

NSFERASE 

(EC 2.1.1 .14) 

(METHIONINE 

SYNTHASE, 

VTTAMIN-B12 

INDEPENDEN 

T ISOZYME). - 

ESCHERICHIA 

CO LI. 


3174642 


1 


-CAT 


TCA- 


250 


1620 


Reverse 


367. 


HIGH- 
AFFINITY 
BRANCHED- 
CHAIN AMINO 
ACID 

TRANSPORT 

ATP-BINDING 

PROTEIN 

BRAF.-P 

SEUDOMONA 

S 

AERUGINOSA. 


3174644 


1 


ATG 


TGA 


283 


765 


Forward 


368. 1 


Unknown 


3174652 




ATG 


rGA < 


m 


1134 


Forward 


369. i 


Unknown 


3174654 




ATG ' 


fGA 


158 


319 


Forward 
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Codon | 


Position 




ISEQ ID 1 IDENTITY 
NO: 


Assembly |ORF# 
ID 


Start 


(Stop 


Start 


Stop Direction 


370. 


ADENYLOSUC 
CINATE 
LYASE (EC 
4.3.2.2) 

(ADENYLOSU 
CCTNASE) 
(ASL). - 
BACILLUS 
SUBTIL IS. 


: 3174658 1 




-CAT 


TTA- 


941 


1477 Reverse 


371. 


[Unknown 


3174660 1 


Iatg 


tga 


454 


594 [Forward I 


372. 


1 Unknown 


3174662 1 




-CAT 


TCA- 


103 


768 [Reverse | 


373. 


50S 

Iribosomal 


3174664 ll 


Iatg 


IT A A 


33 


446 Forward 1 




PROTEIN L16 
















MYCOPLASM 
A 
















CAPRICOLUM. 
















p 1 AGE V 
SPORULATIO 


3174666 1 


Iatg 


Itag 


151 


316 Forward j 




N PROTEIN E. 
















- BACILLUS 
















SUBTILIS. 














375. 
376. 


Unknown 2 

Unknown 2 


t 174676 1 ~ 
(174678 Jl 


-cat 

-CAT 


cta- : 

TCA- 1 


!4 2 
2 


159 iReverse 1 
107 I Reverse 


377. 


1 Unknown 


H74714 ll 


Iatg 


TAA |8 


39 


363 1 Forward | 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


378. 


ATP- 

DEPENDENT 
CLP 

PROTEASE 

ATP-BINDING 

SUBUNIT 

CLPX. - 

ESCHERICHIA 

COLL 


3174736 


1 


CTG 


TGA 


3 


317 


Forward 


379. 


ATP- 

DEPENDENT 
CLP 

PROTEASE 

ATP-BINDING 

SUBUNIT 

CLPX. - 

ESCHERICHIA 

COLL 


3174736 


2 


ATG 


TAG 


411 


797 


Forward 


380. 


Unknown 


3174738 


1 


ATG 


TAA 


184 


558 


Forward 


381. 


Unknown 


3174744 


1 


ATG 


TAA 


507 


1082 


Forward 


382. 


PYRROLIDON 
E- 

CARBOXYLAT 
E PEPTIDASE 
(EC 3.4. 19.3) 
(5- 

OXOPROLYL- 
PEPTIDASE). - 
STR 

EPTOCOCCUS 
PYOGENES. 


3174748 


1 


ATG 


TAA 


218 


691 


Forward 


383. 


Unknown 


3174748 


2 


ATG 


TGA 


693 


875 


Forward 


384. 


Unknown 


3174760 


1 


ATG 


TGA 


280 


495 


Forward 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORFfl 


Start 


Stop 


Start 


Stop 


Direction 


IOC 

385. 


Unknown 


3174770 


1 


ATG 


TGA 


226 


402 


Forward 


386. 


RECOMBDMAT 
ION PROTEIN. 
- BACILLUS 
SUBTILIS. 


3174772 


1 


ATG 


TAA 


898 


1527 


Forward 


387. 


ALPHA- 

ACETOLACTA 

TE 

DECARBOXYL 
ASE (EC 
4.1.1.5).- 
ENTEROBACT 
ER 

AEROGENES 

(AEROBA 

CTER 

AEROGENES). 


3174774 


1 


ATG 


TAG 


155 


550 


Forward 


388. 

< 


3- 

ISOPROPYLM 
ALATE 

DEHYDRATAS 

E(EC 4.2.1.33) 

(ISOPROPYLM 

ALATE 

ISOMERASE) 

(ALPHA- IPM 

ISOMERASE). 

LACTOCOCCU 
S LACTTS 
[SUBSP. 
LACTIS) 
STREPTOCOC 

:us LAcns). 


3174784 




ATG 


TAG 


291 


650 


"orward 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 

n> 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


389. 


peptide chain 
release factor 1 - 
Bacillus subtilis 


3174802 


1 


ATG 


TAA 


293 


1372 


Forward 


390. 


ATP-dependent 
Clp proteinase 
(EC 3.4.21.-) 
chain clpL - 
Lactococcus 
lactissubs p. 
lactis plasmid 
pUCL22 


3174806 


1 


-CAT 


CTA- 


592 


1194 


Reverse 


391. 


Unknown 


3174812 




ATG 


TAA 


55 


933 


"orward 


392. 


FAD synthase 
(EC 6.3.-.-)- 
Corynebacteriu 
m 

ammoniagenes 


3174818 




-CAT 


TTA- 


4 


921 


Reverse 


393. 

] 


CYTIDINE 

DEAMINASE 

(EC 3.5.4.5) 

(CYTIDINE 

AMINOHYDR 

OLASE) 

\*LJ/\ ). - 

BACILLUS 
SUB TILIS. 


3174826 




ATG 


TAG 


25 


414 


; orward 


394. 1 


Unknown : 


J 174832 




-CAT ' 


rr A- : 


13 : 


582 


Reverse 


395. 1 


Unknown : 


5174838 




ATG ' 


fAA 


184 : 


)40 1 Forward 
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Codon 



Position 



ISEQID 
NO: 



IDENTITY 



Assembly 
ID 



ORF# 



Start Stop 



Start 



Stop Direction 



SARPLRPO 
NCBI gi: 
677848NCBI gi 
473748 - 
Staphylococcus 
aureus. 



3174842 



-CAT TCA- 



299 



574 Reverse 



Unknown 



3174852 



-CAT TCA- 



TRK SYSTEM 

POTASSIUM 

UPTAKE 

PROTEIN 

TRKA 

HOMOLOG. - 
METHANOSA 
RCINA MAZE! 



181 



3174858 



CTG TGA 



540 | Reverse 



1077 Forward 



Unknown 



3174870 



-CAT ICTA- 



228 



953 I Reverse 



6-PHOSPHO- 
BETA- 

GALACTOSID 
ASE (EC 
2.1.85) 
(BETA-D- 
PHOSPHOGAL 
ACTOSIDE 
GALACTOHY 
DROLASE). - 
LACTOGOCCU 
SLACTIS 
(SUBSP. 
LACTIS) 
(STREPTOCOC 
CUS LACTIS) 



3174878 



-CAT TCA- 



227 



667 Reverse 



385. 



hemolytic factor 
Bacillus cereus 



3174926 



-CAT TCA- 



591 



1142 



Reverse 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


386. 


DIPEPTIDYL 

PEPTIDASE IV 

(EC 3.4.14.5) 

(X-PROLYL 

DIPEPTIDYL 

AMINOPEPTID 

ASE IV) ( X- 

PDAP). - 

LACTOCOCCU 

S LACTIS 

(SUBSP. 

CREMORIS) 

(STREPTOCOC 

CUS 

CREMORIS). 


3174936 


1 


ATG 


TAA 


189 


479 


Forward 


387. 


ARGININE 
HYDROXIMAT 

RESISTANCE 
PROTEIN. - 
BACILLUS 
SUBTILIS. 


3174936 


2 


ATG 


TGA 


496 


846 


Forward 


388. 


Unknown 


3174938 




ATG 


TGA 


366 


587 


*orward 


389. 


Unknown 


3174946 




-CAT 


CTA- 


111 


380 


leverse 


390. 


Unknown 


3174952 




-CAT 


TTA- : 


37 


B19 


Reverse 


391. 


DnaK- protein - 

Lactococcus 

lactis 


3174970 




CTG 


TGA : 


3 : 


380 


Forward 


392. 


Unknown 


3174990 




ATG ' 


rAA 


141 


1058 


Forward 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


409. 


LACALS NCBI 
gi: 473900- 
Lactococcus 
lactis (strain 
DSM 20384, 
sub-species lact 
is) DNA. 


3175000 


1 


ATG 


TAA 


794 


1015 


Forward 


410. 


Unknown 


3175006 


1 


ATG 


TGA 


56 


631 


Forward 


411. 


cellobiose 
phosphotransfer 
ase system celA 
- Bacillus 
stearothermophil 
us 


3175006 


2 


ATG 


TAA 


646 


963 


Forward 


412. 


Unknown 


3175010 


1 


-CAT 


TTA- 


13 


231 


Reverse 


413. 


Unknown 


3175014 


1 


ATG 


TGA 


58 


219 


Forward 


*f l*t. 


Uiuuiown 


3175016 


1 


ATG 


TAA 


120 


503 


Forward 


415. 


Unknown 


3175032 


1 


ATG 


TAA 


364 


669 


forward 


416. 


Unknown 


3175046 


1 


ATG 


TGA 


105 


401 ] 


Forward 
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Codon 




Position 






SEQIO 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


417. 


GLYCYL- 
TRNA 

SYNTHETASE 
ALPHA CHAIN 
(EC 6.1.1.14) 
(GLYCINE— 
TRNA LIGASE 
ALPHA CH 
AIN) (GLYRS). 

ESCHERICHIA 
COLL 


3175074 


1 


-CAT 


CTA- 


107 


787 


Reverse 


418. 


GLYCYL- 
TRNA 

SYNTHETASE 
ALPHA CHAIN 
(EC 6.1.1.14) 
(GLYCINE— 
TRNA LIGASE 
ALPHA CH 
AIN) (GLYRS). 

ESCHERICHIA 
COLI. 


3175074 


2 


-CAT 


TCA- 


787 


936 


Reverse 


419. 

] 
1 


endopeptidase 
PepO (EC 3.4.-.- 
) - Lactococcus 
actis subsp. 
lactis 


3175092 




-CAT ' 


TTA- - 


47 


490 


Reverse 


420. ] 
] 
1 
1 


BSCELABCD '. 
^CBI gi: 
395746- 
3acillus subtilis. 


J 175094 




-CAT < 


ZTA~ : 


03 


019 


leverse 
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Codon 




Position 






SEQ ID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


421. 


DNA LIGASE 
(EC 6.5.1.2) 
(POLYDEOXY 
RIBONUCLEO 
TIDE 

SYNTHASE 
(NAD+)). - 
ESGHERICHI 
A COLL 


3175098 


1 


ATG 


TAA 


134 


265 


Forward 


422. 


Unknown 


3175098 


2 


ATG 


TAA 


210 


446 


Forward 


423. 


TAGATOSE 
1.6- 

DIPHOSPHAT 
E ALDOLASE 
(EC 4.1.-.-). - 
LACTOCOCCU 
SLACTIS 
(SUBSP. LA 

ens) 

(STREPTOCOC 
CUS LACTIS). 


3175102 


1 


ATG 


TGA 


495 


950 


Forward 


424. 


Unknown 


3175104 




CTG 


TAA 




525 


J orward 


425. 


Unknown 


3175114 




-CAT 


TTA- 


59 


508 


leverse 


426. 


Unknown 


3175126 




ATG 


TAA 


3 


203 


; orward 


427. 


Unknown 


3175136 




ATG ' 


TOA 


370 


582 


"orward 


428. 


Unknown 


3175138 




-CAG " 


rcA- 


1134 


1448 


Reverse 
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Codon 




Position 






SEQ ID 

NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


429. 


2- 

ISOPROPYLM 

ALATE 

SYNTHASE 

(EC 4.1.3.12) 

(ALPHA- 

ISOPROPYLM 

ALATE 

SYNTHASE) 

(ALPH A-IPM 

SYNTHETASE) 

LACTOCOCCU 

SLACTIS 

(SUBSP. 

LACTIS) 

(STREPTOCOC 

CUS LACTTS 


3175140 


1 


ATG 


TAA 


963 


1382 


Forward 


430. 


LPLC 

PROTEIN. - 
BACILLUS 
SUBTILIS. 


3175150 


1 


-CAT 


TCA- 


246 


827 


Reverse 


431. 


Unknown 


3175158 


1 


-CAT 


TTA- 


114 


407 


Reverse 


432. 

. 

1 
1 

< 


PROTEASE 
SYNTHASE 
AND 

SPORULATIO 
N NEGATIVE 
REGULATORY 
PROTEIN PAI 
I. - BACILLUS 
5UBTILIS. 


3175164 


1 


-CAT 


TTA- 


245 


754 


Reverse 



58 



WO 97/43303 



PCT/US97/07950 











Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


433. 


PUTRESCINE 

TRANSPORT 

ATP-BINDING 

PROTEIN 

POTG. - 

ESCHERICHIA 

COLL 


3175172 


1 


ATG 


TAG 


380 


844 


Forward 


434. 


TRIOSEPHOSP 
HATE 

ISOMERASE 
(EC 5.3.1.1) 
(TIM). - 
BACILLUS 
SUBTILIS. 


3175174 


1 


-CAT 


TTA- 


270 


818 


Reverse 


435. 


Unknown 


3175188 


1 


-CAT 


TTA- 


429 


935 


Reverse 


436. 


PNUC 
PROTEIN. - 
SALMONELLA 
TYPHIMURIU 
M. 


3175192 


1 


ATG 


TAG 


111 


536 


Forward 


437. 


recF protein - 
Streptococcus 
pyogenes 


3175228 


1 


-CAT 


TTA- 


144 


356 


Reverse 


438. 


3- 

DEHYDROQUI 
NATE 
SYNTHASE 
(EC 4.6.1 3). - 

? ^ C WPP 1C WT A 

COLL 


3175240 




-CAT 


TTA- 


410 


1123 


Reverse 


439. 


Unknown 


3175256 




-CAT ' 


TTA- 


77 : 


283 


Reverse 


440. 


Unknown 


3175262 




ATG ' 


TAA 


B4 


377 


'orward 


441. 


Unknown 


3175266 




-CAT ' 


TTA- 


50 


m 


Reverse 


442. 


Unknown 


3175288 




ATG ' 


TGA - 


430 I 


522 


Forward 
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Codon 



Position 



ISEQ ID (IDENTITY 
NO: 



Assembly |ORF# 
ID 



Start Stop Start Stop Direction 



1443. INAD 

DEPENDENT 
METHANOL 
DEHYDROGE 
NASE (EC 
1.1.1.244) 
(MEDH) 
BACILLUS 
METHANOL 
ICUS 



3175298 1 



-CAT TTA- 440 658 iReverse 



1444. (ATP 

SYNTHASE 
ALPHA CHAIN 
(EC 3.6.1.34).- 
ENTEROCOCC 
US FAECALIS 
(STREPTOCOC 
CUS 

FAECALIS) 



3175306 1 



ATG TGA 478 



639 Forward 



1445. IANTHRANILA 
TE SYNTHASE 
COMPONENT 
n (EC 4.1.3.27) 
[(GLUT AMINE 
IAMIDO- 
ITRANSFERAS 
E) .- 

LACTOGOCCU 
S LACTIS 
(SUBSP. 
LACTIS) 
(STREPTOCOC 
CUS LACTIS). 



3175310 1 



ATG TGA 117 



683 Forward 



1446. [Unknown 



3175322 1 



CTGlTAA 3 



[413 [Forward 
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1 






Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ED 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


447. 


ALANYL- 
TRNA 

SYNTHETASE 
(EC 6.1.1.7) 
(ALANINE- 
TRNA 
LIGASE) 
(ALARS). - 
ESCHERICH 
IA COLL 


3175332 


1 


ATG 


TAA 


432 


1250 


Forward 


448. 


dihydrolipoamid 
e dehydrogenase 
(EC 1.8.1.4)- 
Pelobacter 
carbinolicus 


3175356 


1 


-CAT 


TTA- 


486 


1241 


Reverse 


449. 


Unknown 


3175366 


1 


CTG 


TGA 


1 


405 


Forward 


450. 


Unknown 


3175380 


1 


-CAT 


TCA- 


168 


413 


Reverse 


451. 


ASPARTATE- 
AMMONIA 
LIGASE (EC 
6.3.1.1) 

(ASPARAGINE 
SYNTHETASE) 

ESCHERICHIA 
C OLI. 


3175380 


2 


-CAT 


TTA- 


422 


913 


Reverse 


452. 

1 

] 


aldose 1- 
epimerase 
precursor 
(mutarotase) 
(mro) homolog - 
Haemophilus 
influenza e 
[strain Rd 
KW20) 


3175406 




-CAA 


TTA- 


6 


701 


Reverse 
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Codon 




Position 






SEQ ID 

NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


453. 


DIHYDROORO 
TATE 

DEHYDROGE 
NASE (EC 
1.3.3.1) 

(DfflYDROOR 

OTATE 

OXIDASE) 

(DHODEHASE) 

. -BACILLUS 

SUBTILIS. 


3175442 


1 


ATG 


TAA 


198 


551 


Forward 


454. 


SINGLE- 
STRAND 
BINDING 
PROTEIN 
(SSB) (HELIX- 
DESTABILIZI 
NG PROTEIN). 
- BACILLUS S 
UBTILIS. 


3175444 


1 


-CAT 


TCA- 


12 


242 


Reverse 


455. 

( 


3-OXOACYL- 

{ACYL- 

CARRIER 

PROTEIN] 

REDUCTASE 

(EC 1.1.1.100) 

(3- 

KETOACYL- 
ACYLCAR 
RJER PROTEIN 
REDUCTASE). 

ESCHERICHIA 
COLL 


3175444 


2 


-CAT 


TTA- 


319 


717 


Reverse 
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Codon 




Position 






SEQID 

NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


456. 


Unknown 


3175446 


1 


-CAT 


TCA- 


432 


971 


Reverse 


457. 


Unknown 


3175450 


1 


ATG 


TAG 


75 


584 


Forward 


458. 


LPLB 

PROTEIN. - 
BACILLUS 
SUBTIUS. 


3175478 


1 


ATG 


TAA 


63 


278 


Forward 


459. 


Unknown 


3175494 


1 


ATG 


TGA 


514 


759 


Forward 


460. 


Unknown 


3175500 


1 


ATG 


TGA 


310 


537 


Forward 


461. 


30S 

RIBOSOMAL 
PROTEIN S9 
(BS10). - 
BACILLUS 
STEAROTHER 
MOPHDLUS. 


3175504 


1 


-CAT 


TTA- 


537 


866 


Reverse 


462. 


AMINO ACID 

PERMEASE 
ROCE.- 
BACILLUS 
SUBTILIS. 


3175512 


1 


ATG 


TAA 


319 


816 


Forward 


463. 


ARGINDME 
HYDROXIMAT 

RESISTANCE 
PROTEIN. - 
BACILLUS 
SUBTILIS. 


3175526 


1 


-CAT 


CTA- 


1 


381 


Reverse 


464. 


Unknown 


3175532 


1 


ATG 


TAA 


309 


644 


Forward 


465. 


PI 15 protein - 
Mycoplasma 
lyorhinis 
(SGC3) 


3175536 




ATG 


TGA 


132 


899 


Forward 


466. 


Unknown 


3175538 




ATG 


TGA 


427 


507 


7 orward 


467. 


Unknown 


3175552 




ATG ' 


TAA 


76 


381 


Forward 


468. | Unknown 


3175556 




-CAT ' 


TTA- 


175 


597 


Reverse 
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Codon 



Position 



SEQID 
NO: 



IDENTITY 



Assembly 
ID 



ORF# 



Start 



Stop Start 



Stop 



Direction 



469. 



Unknown 



3175564 



ATG 



TAA 



164 



Forward 



470. 



471. 



30S 

RIBOSOMAL 
PROTEINS 17 
(BS16). - 
BACILLUS 
SUBTILIS. 



3175566 



-CAT 



TTA- 



388 



648 



Reverse 



Unknown 



3175600 



-CAT 



CTA- 



366 



626 



Reverse 



472. 



UCD 

PROTEIN. - 
HAEMOPHBLU 
S 

INFLUENZAE. 



3175612 



ATG 



TGA 



134 



535 



'orward 



473. 



DNA 
POLYMERASE 
m. ALPHA 
CHAIN (EC 
2.7.7.7). - 
BACILLUS 
SUBTILIS. 



3175632 



TTG 



AA 



506 



-orward 



474. 



475. 



ASPARTATE 

CARBAMOYL 

TRANSFERAS 

E(EC 2.1.3.2) 

(ATCASE). - 

BACE.LUS 

SUBTILIS. 



3175638 



ATG 



GA 



269 



526 



"orward 



SPOOB- 
ASSOCIATED 
GTP-BINDING 
ROTEIN. - 
BACILLUS 
SUBTILIS. 



3175640 



CAT 



CA- 



21 



476 



Reverse 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


476. 


URACIL 
PERMEASE. - 
BACILLUS 
CALDOLYTIC 
US. 


3175644 


1 


-CAT 


TTA- 


42 


287 


Reverse 


477. 


Unknown 


3175650 


1 


ATG 


TAA 


158 


676 


Forward 


478. 


DIHYDRODIPI 

COLINATE 

SYNTHASE 

(EC 4.2.1.52) 

(DHDPS). - 

BACILLUS 

SUBTILIS. 


3175652 


1 


-CAT 


TTA- 


179 


514 


Reverse 


479. 


Unknown 


3175664 


1 


ATG 


TGA 


596 


979 


Forward 


480. 


DIACYLGLYC 
EROL KINASE 
(EC 2.7.1.107) 
(DAGK) 
(DIGLYCERID 
E KINASE) 
(DGK). - 
STREP 
TOCOCCUS 
MUTANS. 


3175670 


1 


-CAT 


TTA- 


77 


367 


Reverse 


481. 


DnaK protein - 

^actococcus 

actis 


3175688 


1 


-CAA 


TTA- 


170 


1093 


Reverse 


482. 


serine 
transporter' 
(sdaQ homolog 
- Haemophilus 
influenzae 
[strain Rd 
KW20) 


3175698 


1 


ATG 


TGA 


289 


528 


Forward 
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Codon 



Position 



SEQID 
NO: 



IDENTITY 



Assembly 
ID 



ORF# 



Start 



Stop 



Start 



Stop 



Direction 



phage infection 
protein 
precursor - 
Lactococcus 
lactis subsp. 
lactis (strain C2 
) 



3175726 



-CAG 



TTA- 



152 



892 



Reverse 



SCU 19250 

NCBIgi: 

625076- 

Streptomyces 

coelicolor. 



3175744 



ATG 



TGA 



431 



Forward 



ivH protein - 
Escherichia coli 



3175754 



ATG 



TAA 



677 



Forward 



URACIL 
PERMEASE. - 
BACILLUS 
SUBTILIS. 



3175758 



-CAT 



TTA- 



49 



555 



Reverse 



Unknown 



3175770 



ATG 



TAG 



28 



71 



Forward 



Unknown 



3175770 



Unknown 



3175774 



ATG 
CTG 



TAA 
TAA 



41 



311 
408 



Forward 
Forward 



METHIONYL- 
TRNA 

SYNTHETASE 
EC6.lrT.iO) * 

(METHIONINE 
TRNA 

LIGASE) 
METRS). - BA 

CILLUS 

SUBTILIS. 



3175778 



ATG 



TGA 



384 



767 



Forward 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


491. 


STAGE ffl 
SPORULATIO 
N PROTEIN E. 
- BACILLUS 
SUBTILIS. 


3175786 


1 


-CAT 


TTA- 


8 


202 


Reverse 


492. 


L-FUCOSE 
ISOMERASE 
(EC 5.3.1. O.- 
ESCHERICHIA 
COLL 


3175790 


1 


ATG 


TGA 


328 


534 


Forward 


493. 


L-FUCOSE 
ISOMERASE 
(EC 5.3.1.-).- 
ESCHERICffiA 
COLL 


3175790 


2 


ATG 


TGA 


440 


784 


Forward 


494. 


50S 

RIBOSOMAL 
PROTEIN L15. 
- BACILLUS 
STEAROTHER 
MOPHILUS. 


3175792 


1 


ATG 


TAA 


230 


670 


Forward 


495. 


Unknown 


3175794 


1 


-CAT 


TCA- 


190 


381 


Reverse 


496. 


COME 
OPERON 
PROTEINS. - 
BACILLUS 
SUBTILIS. 


3175800 


1 


-CAT 


TTA- 


182 


649 


Reverse 


497. 


Unknown 


3175804 


1 


CTG 


TAA 


1 


567 


Forward 


498. 


Unknown 


3175806 


1 


-CAT 


TCA- 


283 


657 


Reverse 


499. 


Unknown 


3175812 


1 


-CAT 


TCA- 


78 


596 


Reverse 
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Codon 



Position 



SEQID 
NO: 



IDENTITY 



Assembly |ORF# 
ID 



Start 



Stop 



Start 



Stop Direction 



500. 



FOLYLPOLYG 
LUTAMATE 
SYNTHASE 
(EC 6.3.2.17) 
(FOLYLPOLY 
GAMMA- 
GLUT AMATE 
SYNTHETASE 
) (FPGS). - 
BACILLUS 
SUBTTLIS. 



3175836 1 



ATG 



TAG 



60 



590 Forward 



Unknown 



3175848 1 



CAT 



CA- 



72 



620 Reverse 



PHOSPHOSERI 
NE 

PHOSPHATAS 
E (EC 3.1.3.3) 
(PSP) (O- 

HOSPHOSERI 
NE 

PHOSPHOHYD 
ROLASE) .- 
ESCHERICHIA 
COLL 



3175854 1 



ATG 



AA 



347 



41 Forward 



Unknown 



AEROBIC 
GLYCEROL-3- 
PHOSPHATE 
DEHYDROGE 
NASE (EC 
1.99.5). - 
BACILLUS 
SUBTILIS. 



3175866 1 



3175882 1 



-CAT 



-CAA 



CTA- 



TTA- 



175 



254 



534 [Reverse 



520 Reverse 



Unknown 



3175896 1 



CAT 



TTA- 



64 



636 I Reverse 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


506. 


Unknown 


3175916 


1 


ATG 


TGA 


177 


437 


Forward 


507. 


DNA- 

DIRECTED 
RNA 

POLYMERASE 
BETA CHAIN 
(EC 2.7.7.6) 
(TRANSCRIPT 
ASE BETA 
CHAIN). - 
BACILLUS 
SUBTILIS. 


3175948 


1 


TTG 


TAG 


2 


187 


Forward 


508. 


Unknown 


3175960 


1 


ATG 


TGA 


534 


758 


Forward 


509. 


DNA 

topoisomerase 
(ATP- 

hydrolyzing) 
(EC 5.99.1.3) 
chain B - 
Staphylococcus 
aure us 


3175984 


1 


-CAT 


TTA- 


125 


604 


Reverse 


510. 


GLUTAMYL 

ENDOPEPTID 

ASE 

PRECURSOR 

(EC 3.4.21.19) 

(GLUTAMATE 

SPECIHC 

ENDOPEPTID 

A SE) (GSE). - 

BACILLUS 

LICHENIFOR 

MIS. 


3175998 


1 


-CAT 


TCA- 


43 


540 


Reverse 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


511. 


6- 

phosphofructoki 
nase (EC 
2.7.1.11)- 
Lactococcus 
lactis 


3176002 


1 


ATG 


TAA 


662 


829 


Forward 


512. 


Unknown 


3176010 


1 


CTG 


TAA 


1 


546 


Forward 


513. 


ASPARTYL- 
TRNA 

SYNTHETASE 

(EC 6.1.1.12) 

(ASPARTATE- 

TRNA 

LIGASE) 

(ASPRS). - 

THER MUS 

AQUATICUS 

(SUBSP. 

THERMOPHIL 

US). 


3176030 


1 


-CAT 


TCA- 


140 


625 


Reverse 


514. 


Unknown 


3176046 




-CAT 


TTA- 


242 


454 


Reverse 


515. 


GLYCOGEN 
BIOSYNTHESI 
S PROTEIN 
GLGD. - 
BACILLUS 
SUBTILIS. 


3176048 




ATG 


TAG 


411 


704 


Forward 


516. 


S7I704 NCBI 
gi: 560722- 
Legionella 
pneumophila 
Philadelphia- 1. 


3176050 


■ 


ATG 


TGA : 


205 


492 


Forward 


517. 


PFS PROTEIN : 
P46). - 

escherichia 
:oli. 


5176076 




-CAT ' 


rcA- < 


to : 


134 


Reverse 


518. 


Jnknown '. 


$176082 ] 




VTG ' 


TAA i 


18 i 


191 


'orward 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


519. 


SGHRDT NCBI 
gi: 510450- 
Streptomyces 
griseus. 


3176086 


1 


-CAG 


TCA- 


23 


535 


Reverse 


520. 


Unknown 


3176108 


1 


-CAT 


TTA- 


4 


501 


Reverse 


521. 


trsB protein - 

Yersinia 

enterocolitica 


3176112 


1 


ATG 


TGA 


127 


408 


Forward 


522. 


Unknown 


3176116 


1 


ATG 


TAA 


198 


515 


Forward 


523. 


Unknown 


3176120 


1 


-CAA 


TTA- 


295 


729 


Reverse 


524. 


SIGNAL 
PEPTIDASE I 
(EC 3.4.21.89) 
(SPASE I) 
(LEADER 
PEPTIDASE I). 
- BACILLUS 
CA 

LDOLYTICUS. 


3176124 


1 


-CAT 


TCA- 


298 


570 


Reverse 


525. 


ligoendopeptidas 
cF- 

Lactococcus 
actis 


3176132 


1 


-CAT 


TCA- 


51 


380 


Reverse 


526. 


Unknown 


3176134 




-CAT 


TTA- 


299 


436 


Reverse 


527. 


Unknown 


3176136 




CTG 


TAA 




273 


"orward 


528. 


Unknown 


3176152 




-CAT 


TCA- 


260 


688 


Reverse 


529. 


Unknown 


3176158 




ATG 


TGA 


273 


386 


Forward 




murine- 
nucleoside 
phosphorylase 
fEC 2.4.2.1)- 
Bacillus subtilis 
(fragment) 


3176172 




-CAT 


TTA- 


89 


331 


Reverse 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


531. 


Unknown 


3176178 


1 


-CAA 


CTA- 


101 


532 


Reverse 


532. 


PTS SYSTEM. 
LACTOSE- 
SPECIFIC IIBC 
COMPONENT 
(EDBC-LAC) 
(LACTOSE- 
PERMEASE 
IIBC 

COMPONENT) 
(PHOSPHOTR 
ANSFERASE 
ENZYME II, 
BC 

COMPONENT) 
(EC 2.7. 1.69) 
(EII-L AC). - 
LACTOCOCCU 
SLACTIS 
(SUBSP. 
LACTIS) 
(STREPTOCOC 
CUS LACTIS). 


3176182 


1 


-CAT 


TCA- 


43 


246 


Reverse 


533. 


Unknown 


3176184 




ATG 


TGA 


61 


336 


Forward 


Jj'r. 


uwcnown 


3176188 




ATG 


TAG 


42 


248 


Forward 


535. 


Unknown 


3176208 




ATG ' 


I*AA 


41 


?.38 


Forward 


536. 


Unknown 


3176216 




-CAT ' 


ITA- 


135 


335 


Reverse 


537. 


Unknown 


3176248 




-CAT " 


ITA- 


141 


386 


Reverse 


538. 


Unknown 


3176260 




ATG * 


rGA 


158 


343 ] 


Forward 


539. 1 


Unknown 


3176272 




-CAT ' 


ITA- 


331 


t65 1 


Reverse 
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Codon 




Position 






SEQID 
NO: 


IDENTITY 


Assembly 
ID 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


540. 


REGULATORY 

PROTEIN 

MTRR. - 

NEISSERIA 

GONORRHOE 

AE. 


3176280 


I 


ATG 


TAA 


287 


502 


Forward 


541. 


Unknown 


3176288 


1 


-CAA 


TCA- 


5 


520 


Reverse 


542. 


Unknown 


3176304 


1 


-CAT 


TCA- 


41 


433 


Reverse 


543. 


Unknown 


3176330 


1 


CTG 


TGA 


2 


298 


Forward 


544. 


Unknown 


3176330 


2 


ATG 


TAG 


271 


381 


Forward 


545. 


6-PHOSPHO- 
BETA- 

GLUCOSIDAS 
E(EC 3.2.1.86). 
- 

ESCHERICHIA 
COLL 


3176338 


1 


-CAT 


TCA- 


5 


130 


Reverse 


546. 


Unknown 


3176394 


1 


-CAT 


TTA- 


17 


223 


Reverse 


547. 


Unknown 


3176398 


1 


-CAT 


TTA- 


41 


310 


Reverse 


548. 


Unknown 


3176420 


1 


-CAA 


TTA- 


382 


678 


Reverse 


549. 


jossible acid 
phosphatase 


3176446 




ATG 


TGA 


113 


475 


forward 


550. 


Unknown 


3176480 




ATG 


TAG 


151 


417 


-orward 


551. 


Unknown 


3176542 




ATG 


TAA 


129 


446 


'orward 


552. 


Unknown 


3176560 




-CAT 


TTA- 


102 


374 


Reverse 
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Table 2 shows the correlation between the SEQ ID NO of each DNA sequence of 
the invention with the SEQ ID NO(S) of polypeptide or polypeptides that its open reading 
frame(s) encodes. For example, the DNA of SEQ ID NO: 1 encodes one polypeptide, that 
of SEQ ID NO:24. Whereas, the DNA of SEQ ID NO:2 encodes two polypeptides, the 
polypeptides of SEQ ID NO:286 and SEQ ID N0.287. 



TABLE 2 



DNA 


Protein (open reading frame) 




1 


263 




2 


264 




3 


265 


. . 


4 


266 




5 


267 




6 


268 




7 


269 


. 


8 


270 




9 


271 




10 


272 




11 


273 




12 


274 




13 


275 




14 


276 




15 


277 




16 


278 




17 


279 




18 


280 




19 


281 






262 




21 


283 




22 


284 




23 


285 




24 


286.287 




25 


288 




26 


289 




27 


290 




28 


291 




29 


292 




30 


293 




31 


294 




32 


295 




33 


296 
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DNA 


Protein (open reading frame) 




34 


297 




35 


298 




36 


299 




37 


300 




38 


301 




39 


302 




40 


303 




41 


304 




42 


305.306 




43 


307 




44 


308.309 




45 


310 




46 


311.312 




47 


313 




48 


314 




49 


315316 




50 


317.318 




51 


319.320 




52 


321.322 




53 


323 




54 


324 




55 


325 




56 


326.327 




57 


328 




58 


329,330 




59 


331 




60 


332 




61 


333.334 





62 


335,336 




63 


337 


— 


64 


338 




65 


339- 




66 


340 




67 


341 




68 


342 




69 


343 




70 


344 




71 


345 




72 


346 




73 


347 




74 


348,349 
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UP* A 


Protein (open reading frame) 


/0 


33U 


/O 


331 


/ / 


1C1 

332 


7fi 
/o 


J3.3»334 


"70 


333,330 


©a 


33/ 


Ol 

81 


ICQ 

338 


82 


ICO 1£A 

359,360 


Ol 

83 


361 


84 


362 


83 


363 


86 


364,363 


QT 
0/ 


366 


go 

88 


367 


89 


368 


90 
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91 


370 


92 
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93 
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94 


373 


AC 

95 
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96 
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97 
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98 
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1 Aft 
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1 AQ 
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in 
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113 


394 


114 


395 


115 


396 
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DNA 


Protein (open reading frame) 


116 


397 


117 


398 


118 


399 


119 


400 


120 


401 


121 


402,403 


122 


404 


123 


405 


124 


406 


125 


407 


126 


408 


127 


409 


128 


410,411 


129 


412 


130 


413 


131 


414 


132 


415 


133 


416 


134 


417,418 


135 


419 


136 


420 


137 


421,422 


138 


423 


139 


424 


140 


425 


141 


426 


142 


427 


143 


428 


144 


429 


145 


430 


146 


431 


147 


432 


148 


433 


149 


434 


150 


435 


151 


436 


152 


437 


153 


438 


154 


439 


155 


440 


156 


441 
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Idna 


Protein (open reading frame) 


157 


442 " 


1 158 


443 " 


159 


444 


160 


445 


161 


446 ' 


162 


447 ~ " 


163 


448 ~ " 


164 


449 — ' ■ 


165 


450,451 ~ 1 " ■ 


166 


452 ' 


167 


~453 " ' " 


168 


454.455 ~~ ~ — 


169 


456 " 


170 


~457 ' ~~~ * — - 


171 


~458 ' 


172 


~459 ~~ — 


173 


~460 " 


174 


"461 ' 


1 175 


Afri ~ — 


176 


463 


177 


"464 ' 


178 


465 


179 


466 


j 180 


467 



186 
187 



190 

1191 



193 
fl94~ 



181 468 

1 182 469 

1 183 470 
184 471 

1 185 472 
473 
474 

1 188 475 

1 189 476 
477 
478 



192 479 



480 
481 
195 482 
1 196 483 
197 484 
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DNA 


Protein (open reading frame) 


198 
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199 


486 




487,488 
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492,493 
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212 
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508 
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509 


221 
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517 
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518 


230 
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231 
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232 


521 


233 
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234 


523 
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524 


236 


525 


237 


526 


238 


527 
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DNA 


Protein (open reading frame) 


239 
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241 
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243 
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245 
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254 


543.544 


255 


545 
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258 
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Examples 

In order to facilitate understanding of the following example certain frequently 
occurring methods and/or terms will be described. 

"Plasmids" are designated by a lower case p preceded and/or followed by capital 
5 letters and/or numbers. The starting plasmids herein are either commercially available, 
publicly available on an unrestricted basis, or can be constructed from available plasmids in 
accord with published procedures. In addition, equivalent plasmids to those described are 
known in the art and will be apparent to the ordinarily skilled artisan. 

"Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction 

10 enzyme that acts only at certain sequences in the DNA. The various restriction enzymes 
used herein are commercially available and their reaction conditions, cofactors and other 
requirements were used as would be known to the ordinarily skilled artisan. For analytical 
purposes, typically 1 ng of plasmid or DNA fragment is used with about 2 units of enzyme 
in about 20 pi of buffer solution. For the purpose of isolating DNA fragments for plasmid 

15 construction, typically 5 to 50 pg of DNA are digested with 20 to 250 units of enzyme in a 
larger volume. Appropriate buffers and substrate amounts for particular restriction enzymes 
are specified by the manufacturer. Incubation times of about 1 hour at 37° C are ordinarily 
used, but may vary in accordance with the supplier's instructions. After digestion the 
reaction is electrophoresed directly on a polyacrylamide gel to isolate the desired fragment. 

20 Size separation of the cleaved fragments is performed using 8 percent 

polyacrylamide gel described by Goeddel, D. etai, (1980) Nucleic Acids Res., 8:4057. 

"Oligonucleotides" refers to either a single stranded polydeoxynucleotide or two 
complementary polydeoxynucleotide strands which may be chemically synthesized. Such 
synthetic oligonucleotides have no 5* phosphate and thus will not ligate to another 

25 oligonucleotide without adding a phosphate with an ATP in the presence of a kinase. A 
synthetic oligonucleotide will ligate to a fragment that has not been dephosphorylated. 

"Ligation" refers to the process of forming phosphodiester bonds between two 
double stranded nucleic acid fragments (Maniatis. T., et at., supra. T p. 146). Unless 
otherwise provided, ligation may be accomplished using known buffers and conditions with 

30 10 units to T4 DNA ligase ("Hgase") per 0.5 pg of approximately equimolar amounts of the 
DNA fragments to be Hgated. 
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Example 1 

isolation of DNA coding for a virulence gene in Streptococcus pneumoniae 

As mentioned above each of the DNAs disclosed herein by virtue of the fact that it 
includes an intact open reading frame is useful to a greater or lesser extent as a screen for 
5 identifying antimicrobial compounds. A useful approach for selecting the preferred DNA 
sequences for screen development is evaluation by insertion-duplication mutagenesis. This 
system disclosed by Morrison et al. f J. Bacteriol. 159:870 (1984), is applied as follows. 

Briefly, random fragments of Streptococcus pneumoniae, strain 0100993 DNA are 
generated enzymatically (by restriction endonuclease digestion) or physically (by sonication 

1 0 based shearing) followed by gel fractionation and end repair employing T4 DNA 

polymerase. It is preferred that the DNA fragments so produced are in the range of 200-400 
base pairs, a size sufficient to ensure homologous recombination and to insure a 
representative library in KcolL The fragments are then inserted into appropriately tagged 
plasmids as described in Hensel tfaL Science 269: 400-403(1995). Although a number of 

15 plasmids can be used for this purpose, a particularly useful plasmid is pJDC9 described by 
Pearce et al., Mol. Microbiol- 9:1037 (1993) which carries the erm gene facilitating 
erythromycin selection in either E. coii or 5. pneumoniae previously modified by 
incorporation of DNA sequence tags into one of the polylinker cloning sites. The tagged 
plasmids are introduced into the appropriate 5. pneumoniae strain selected, inter alia, on the 

20 basis of serotype and virulence in a murine model of pneumococcal pneumonia. 

It is appreciated that a seventeen amino acid competence factor exists (Havastein et 
aL, EES. Natl. Acad- SsL USA 92: 1 1 14(M4 (1995)) and may be usefully employed in this 
protocol to increase the transformation frequencies. A proportion of transformants are 
analysed to verify homologous integration and as a check on stability. Unwanted levels of 

25 reversion are minimized because the duplicated regions will be short (200-400 bp), however 
if significant reversion rates are encountered they may be modulated by maintaining 
antibiotic selection during the growth of the transformants in culture and/or during growth 
in the animal. 

The 5. pneumoniae transformants are pooled for inoculation into mice, eg., Swiss 
30 and/or C57B 1/6. Preliminary experiments are conducted to establish the optimum 
complexity of the pools and level of inoculum. A particularly useful model has been 
described by Veberet al. (J. Antimicrohiol. Chemother 37*17 (1993) in which 10 5 cfu 
inocula sizes are introduced by mouth to the trachea. Strain differences are observed with 
respect to onset of disease e.g.,3-4 days for Swiss mice and 8- 10 days for C57B 1/6. 
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Infection yields in the lungs approach 10 8 cfu/lung. IP administration is also possible when 
genes mediating blood stream infection are evaluated. Following optimization of 
parameters of the infection model, the mutant bank normally comprising several thousand 
strains is subjected to the virulence test Mutants with attenuated virulence are identified by 
5 hybridization analysis using the labelled tags from the "input" and "recovered" pools as 
probes as described in Hensel £1 al. ( Science 269: 400-403(1 995). 5. pneumoniae DNA is 
colony blotted or dot blotted, DNA flanking the integrated plasmid is cloned by plasmid 
rescue in £. coli (Morrison et al. v J. Bacteriol . 159:870 (1984)) and sequenced. Following 
sequencing, the DNA is compared to the nucleotide sequences given herein and the 

10 appropriate ORF is identified and function confirmed for example by knock-out studies. 

Expression vectors providing the selected protein are prepared and the protein is configured 
in an appropriate screen for the identification of anti-microbial agents. Alternatively, 
genomic DNA libraries are probed with restriction fragments flanking the integrated 
plasmid to isolate full-length cloned virulence genes whose function can be confirmed by 

15 "knock-out" studies or other methods, which are then expressed and incorporated into a 
screen as described above. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Black, Michael 
Hodgson, John 
Knowles, David 
Nicholas, Richard 
Stodola, Robert 

(ii) TITLE OF THE INVENTION: Novel Compounds 

(iii) NUMBER OF SEQUENCES: 552 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham Corporation 

(B) STREET: 709 Swedeland Road 

(C) CITY: King of Prussia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19406-0939 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE : FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 14-MAY-1997 

(C) CLASSIFICATION: 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/017670 

(B) FILING DATE: 14-MAY-1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: King, William T. 

(B) REGISTRATION NUMBER: 30,954 
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(C) REFERENCE /DOCKET NUMBER: P50475 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-270-5015 

(B) TELEFAX: 610-270-5090 

(C) TELEX: 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO:l: 



CTANACAAAC 


CAGATAAGGA 


CCAAGAGCTT 


AAAGCTGAAA 


TTCAATCCAT 


TTTTATCGAA 


60 


CACAAGGGAA 


ATTATGCTTA 


TCGCTGTATT 


CATTTAGAAC 


TAAGAAATTC 


GTGCTTATCT 


120 


GGTAAATCAT 


AAAAGAGTTC 


AAGGCTTGAT 


GAAAGTACTC 


AATTTACAAG 


CTAAAATGCG 


180 


ACAGAAACGA 


AAATAAAGGA 


GACGTTGGTA 


AGAAGGCAGA 


GAATCTCATT 


CAAGGCCAAT 


240 


TTGAAGGCTC 


TAAAACAATG 


GAAAAGTGCT 


ACACAGATGT 


GACAGAATTT 


GCCATTCCAG 


300 


CAAGTACTCA 


AAAGCTTTAC 


TTATCACCAG 


TTTTAGATGG 


CTTTAATAGC 


GAAATTATCG 


360 


CCTATAATCT 


TTCAACTTCA 


CCCAACTTAG 


AACAAGTACA 


AACAATGTTG 


GAACAGGCAT 


420 


TCACAGAGAA 


GCACTACGAG 


AATACGATTC 


TCCATAGTGA 


CCAAGGCTGG 


CAATACCAAC 


480 


ACGATTCTTA 


TCATCGGTTC 


CTAGAGAGTA 


AGGGAATTCA 


AGCATCCATG 


TCACGCAAGG 


540 


GAAACAGCCA 


AGATAACGGT 


ATGATGGAAT 


CTTTCTTTGG 


GATTCTGAAA 


TCGGAAATGT 


600 


TTTACGGTTA 


TGAGAAGTCG 


TTTCAGTCGC 


TTAAGCAATT 


GGAACAAGCC 


ATTATAGACT 


660 


ATATTGATTA 


CTACAACAAT 


AAGAGAATTA 


AGGTAAAACT 


AAAAGGACTT 


AGCCCTGTGC 


720 


AATACAGAAC 


TAAATCCTTC 


GGATAAATTA 


ATTGTCTAAC 


TTTTGGGGGG 


CAGTACATTT 


780 


TTGGTATATA 


TAAAATTTGT 


AGGAG 








805 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 764 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

TTCGATCTGG TTACCGTTGC TGATAGTAAT GAAATACCTA CTAACAATTC ACCTAAAGCA 
TCAATTTGAA AAGTTTCTAC TAGAATATGA AATAAAGGAT TAATGGGTCT TGTCAGGTAT 
TTATCAAATT CGCCCTTACG AACCAGGCGT TGCCCTAGAG CCCATAAATT ATCAAAAAAG 
AGATGATCAA TTCCCTTGGG AATTAGGGAA AAGCCATAGA TAAAGGCAAT CTGCTGAAAA 
CTCCACCCAT CAAGTGATGG GATGTTTTGA AATATAATAT TAAGAAATAA CAGGTTTAAC 300 
CCTTGCGTTA AAAAACGCCT AAAACGCCAA CAACGAAGTC AACCTTATAC TCCACAATTT 360 
GCTTGAGGTA TTGTCTGATA AAAATCAAGT GCATACGTTG ATATTTCTTC ATACTAACCT 
CCTTGAATAG TAATGTGGGA CTGTACACGT TTCCAAATAA TTTGGGACAA AGCAATCATA 
ACAAGCAACC AAAATAATTG TAGCAAGACA GCCTGAAGAA TCTGACTGGC ATCGTATTTT 
CCAACAATGA TCATAACTGG AGTATAAATC AAGGATGAAA AAGGCAAAAG GAGAGGATAT 
CTGAGACAAC CTTTGGAGAG GATGCCAAGG GAATCAAACT CCCCGACATT AAAGCCACTA 660 
TGGAAGTCTT AAGTAGGTTG GAACCCCAAA GATTTTTAAA CACAAAGGCT GTAAATCCCA 720 
AGCAAATATT AAAGAAAAAG TTAATCAGAT AGGCGAGCGT TAAG 764 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1133 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTAAGACAGG AGTCCGAATG TCAATACGGA TTTCAGAACG TTCATGATTG ATCATGAGAC 60 

CTGCGACATT AAAGGATAGG TGACCAGAAG GTTCATCTGC TATATCACCA AAGATTTGTC 120 

TTCCTGTGCC GTCTTGACCT GCTTGTGTTG CAAGAAAACT GAGAGCAGGG TGTTCTTGGA 180 

GAGGAGCAAG AAfGGTAGCT AGTCGGATGA CAGCATTGAT ACCTTGACTA GCATCCTTAG 240 

CATGCTTTGG CACTCCGAGA ACCGTTACGG TTTGTTCAGT GGATTGGTAA TCATAACCAG 300 

CTTCTTTGAG ACCGTTACAA ACCTGTTCAT AGAGGGGACC TTGGTAGTTG GCCTTGTCTG 360 

GTACAACGTT AAAGGCGCCT CCTACTTCAA GCTCTAGTTG ATCCGATCCA GAGCCATAAA 420 

GTTTGACCTG TAGAAGCCCT TTTTCAGCAT AGGTCAGAAG AAAAGATGAG TCAGGTGCNA 480 

AGCCCATACT GGCCTGTTCT TCGATGGTAT TGTAGCGTGC CATGCAGCGC CAGAGGGTTT 540 

CCTCATCGGT ACCAAAGATA AAGCGTACGC GCTTTTTGAA CTGAATACCT TGGTCCAGCA 600 

AGCTTTTTAC TGCATAGAGA GCTGCGAGCG GAGGGCCTTT ATCATCTTGG ACACCACGTC 660 

CGATACCCAG CCGTCTTTGA TAGTTGCTTC AAATGGCGGT GTCTGCCAAT CTGCTTCATC 720 

ACCTGATGGA ACAACATCCA AATGACAGAG AATGGCCAGA AGCTCTGCTC CCTGACCGAT 780 

TTCTGCATAT CCGTAATAAC CTTTAGGGTC AAGATAGGTA GTGAAACCTA TGTCTCGACA 840 
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AATCTCTAAA GTTTTTTCTA GGACATCTTG GATTGCTTGT CCAAAAGGTG TTCCATTTTC 900 

TCCTTCATTG AGTACTGAAG GATAGGAAAT CAAGGTTTTT AATGATATAA GAAATTCATC 960 

TTTAACTTGA TTTGTTATAT CTATTTTCAT GGAATCACCT CACTCACCAT TTTATAGGAA 1020 

TGGAAGGAAG GTTCCAAGGA GAAGAAGGGC GATGGTCACT ACAATAATAG CGACTACGAG 1080 

TTTGCCCATG AATTTCCACC AAGTACCAAT GTTGATACGT CCAAGTGCAA GAG u 33 

(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1071 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



TGATAAGTAT GAGACTCGAA GCATTGAACA TTAAACTTCC AGATTTCAGA TATATGGTGG 60 

ACCTTCAAAT CATCATACTT ATTTTGGAGA TAAAAAACGA TATTTACAAG CAATTAAGAA 120 

TTGAGCAAGA GCTATTTACA GTAGCTTAAT GAAAGGAAAG TATGTCAATT ACATCATTTG 180 

TAAAAAGAAT TCAAGATATC ACTCGAAACG ATGCTGGTGT TAATGGTGAT GCTCAACGTA 240 

TTGAGCAAAT GTCTTGGTTA TTATTCTTAA AAATTTATGA TAGCCGTGAA ATGGTTGGGG 300 

AATTAGGAAG AAGGACGGAG TATGGAGTCA ATTATTCCCA GGAGGAATTA AAATGGCGGA 360 

AATTGGGGCT CATGCTCAAA ATGGGGAACG GGTATTGACA GGCGATGAAT TAATTGATTT 420 

TGTCAATAAC AAGTTATTCA AAGAGTTGAA AGGAGCTTGA AATAACTTCA AATATGCCTA 480 

TTCGAAAAAC GATTATTAAA TCAGCTTTTG AAGATGCGAA CAACTATATG AAAAATGGCG 540 

TCTTGTTACG CCAAGTCATC AATGTTATTG ATGAAGTTGA TTTCAATAGC CCTGAAGATC 600 

GTCATTCGTT TAATGATATT TACGAAAAAA TTCTTAAAGA TATTCAAAAT GCTGGGAACT 660 

CAGGAGAATT TTATACGCCA CGTGCAGCGA CTGATTTTAT TGCCGAAGTT CTTGACCCAA 720 

AACTTGGAGA ATCAATGGCA GACCTTGCTT GCGGAACAGG AGGCTTCTTG ACTTCGACTC 780 

TGAACCGTTT AAGTAGTCAA CGTAAAACTA GTGAAGATAC CAAAAAATAT AATACAGCTG 840 

TTTTTGGTAT TGAAAAGAAA GCATTTCCTC ATCTTTTAGC AGTTACAAAT CTGTTTCTTC 900 

ACGAAATTGA TGACCCTAAA ATTGTTCATG GAAATACTTT GGAGAAAAAT GTTCGTGAAT 960 

ATACGGATGA TGAAAAATTT GACATTATTA TGATGAATCC ACCTTTTGGA GGGTCAGAAT 1020 

TAGAAACAAT AAAAAATAAC TTTCCAGCAG AATTACGGAG TTCTGAAACA G 1071 

(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CTAGGATAGA CCGCTTTTTA GCATTTATCT AAGCATTCCA GAGTACATGT ATCTTGCATG 
TGCTCTTTCT TTTGGGGTTG AAACGATAGG AGAAGGAAAT GTTAGAATTG CTTAAATCAA 
TCGATGCTTT TGCTTGGGGA CCGCCCCTCT TGATTTATTG GTCGGAACAG GGATTTACCT 
AACCATGCGG CTAGGACTCT TGCAGGTTTT GCGTCTGCCC AAGGCCTTTC AGCTTATTTT 
TATCCAGGAT AAGGGACATG GTGATGTATC CAGTTTTACA GCTTCTTGTG TAACAGGCCC 
TTGGGCATTC AAACTGGTTG GGAAACAGGG AAATATCCAT TAGGGAGTTG GCGACGGCTA 
TCAAGGTTGG TGGACCAGGA GCTCTATTTT GGATGTGGAT GGCGGCTTTC TTTGGAATGG 
CTACCAAGTA TGCGGAAGGA CTCTTGGCCA TCAAATACCG CACCAAGGAC GACCATGGTC 
CAGTAGCGGG AGGTCCCATG CATTATATCC TTCTAGGGAT GGGAGAAAAG TGGCGACCAC 
TTGCTGTTTT GTTTGCAGTA GCAGGAGTAT TGGTTGCTCT CTTGGGAATC GGAACCTTCA 
CCCAAGTCAA CTCGATTACA GAATCTATCC AAAATACAAC GACGATTTCG CCAGCCATCA 
CAGCTCTCGT CTTGTCTGTC TTTGTAGCGA TTGCAGTCTT TGGTGGACTC AAGTCTATTT 
CTAAGGTTTC AACTACTGTT GTTCCTTTTA TGGCCATCAT TTATATCTTA GGAACTCTTA 
CAGTTATTTT CTTTAATATC GGAAAAATCC CTGGCACAAT CGCTTTAGTC TTTACCTCAG 
CTTTTAGTCC CCTTGCTGCG GTAGGTGGAT TTGCTGGTGC TAGCGTTCGG ATGGCTATTC 
AAAATGGTGT GGCGCGTGGT GTGTTTTCAA ACGAATCTGG TCTGGGTTCT GCTCCTATTG 
CAGCTGCAGC TGCCAAGACA AATGAACCAG TAGAGCAAGG TTTGATTTCC ATGACAGGAA 
CCTTTATTGA TACCCTCATC ATTTGTACTC TAACTGGTTT GACCATCTTG GTAACTGGAG 
TTTGGAGTGG TGACTTGAAT GGGGTTGCCT TGACTCAGTC AGCTTTCTCA ACAGTCTTTT 
CACACTTTGG GCCTGCCCTC TTGACCATCT TCCTTGTGCT TTTTGCCTTT ACAACGATTC 
TAGGTTGGAA CTATTACGGA AGAACGCTGT TTTGAGTTCC TCTTTGGGGT TCGCTTTATC 
TGGCTCTACC GNGTGGTTTT TGTGCTCATG GTCTTGTTAG GAGGATTTAT CGAGTTGGAN 
ATGGTCTGGA TTATCGCAGA TATCGTTAAC GCCTTGATGG CTCTGCCAAA NTNGATTGCC 
CTCTTGGTCT TGTCGCAAGT CGTTATTGCT GAAACTAAAA AGTATTTTGA CAAATAATGG 
AATCACACAG 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 885 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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CTCATCATCT GTCAAAAAGC GTTTCTTAGC AGTCGTGATA TCCATAAAAT AATCTAATAT 60 

CACGATTTCC TCATCCGCAA AGAAAGGAAG GCTGACCAAC TCCAGTGCCA CATCCTTGTA 120 

AACTACTTCT TGCATATCAA AGTAGGCAAA GTTGAGGTCA GCAGAATCAT ACCCAATCTG 180 

TTTCAACACT TGACTCTTCA TCACTTCAAA CTGACCCTGA TCTGTCCCTG TAAATAGGCG 240 

CAGGCTCGGT AAATTCGATA AAGTCAACTT CTGACTTTCT TCAATGGCTA GCATCGTCTC 300 

TCCTTTCTTC AGATTTTTCG ATTTAATTTA GTCAATATAG CGCAATTTCC CACGGAAATC 360 

TTCTAAGCTC TCGTAGCCTT TTTCCACCAT GATTGCTTTC AGTTCATTGG TAAAGCGGTC 420 

AAAAGCACTG ACGCCTTCTT TGTGAAGGGT CGTTCCCACC TGCACCATAC TTGCTCCACA 480 

GAGGATGTGT TCAAAGGCAT CTCGACCAGT CAGAACGCCA CCTGTTCCGA TAATTTGGAT 540 

TTGAGGATTT AAACGTTGAT AAAAGGCGTG AACATTGGCT AGAGCAGTCG GTTTGATGTA 600 

TTATCCACCA ATTCCACCAA AACCATTCTT AGGCCGAATA ACGACAGATT CGTCTTCTAT 660 

ATAGAGGCCG TTTCCGATAG AGTTAACGCA GTTGACAAAC TTGAGCGGAT ATTTGTTGAA 720 

AATAGCTGCC GCTTGATCAA AGTGAACAAT ATCAAAATAA GGTGGCAATT TAATTCCAAG 780 

AGGTTTGGTG AAGTAAGCAA ACACTTCTGC CAAAATCCGG TCTGTTGTCT CAAAATCATA 840 

GGC AATCTGA GGTTTACCTG GAACATTTGG ACAGGAAAGA TTTAG 885 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 774 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CTCAAACAAG TGGTGAGATT CCAGAAAAAA AAACGCGAAG AAATTGCGTC AGATTATCAC 60 

AACTATCTTT GAACAAAAGA TGAATAAGCA TAGTATTAAG TAGAGAATGA AAAAATATTT 120 

TATTGGCGGT TTGGGAAGCA ATGCCTATCA TAGCAAGGAT TTTCTTCAAG AACTAGATTC 180 

GCAGGTCTAT TTT£TAAATC CATATGAAAA GCATCTTCGA GATGAAACAG AATTGAAATC 240 

ATGGTTTAAA AATGAGATTG TAGAGGAAGA ATCTATCTGT CTGATAGGTC ATTCTATTGG 300 

AGGCGATTTA GCTCGTTATT TCGCATCGGA ATTTGAAGAA GTAAAGAAAC TGATTCTTTT 360 

GGATGGTGGC TATCTAGATT TAGATAAGAT TTTACCTTTG GATACAGAGT TAGAGGAAAC 420 

TAAAAATTAT ATCAAATCTC AAATTATTTC GGACTTAGAT GTTCTTACTT CTAAAGAAAA 480 

ATCTGAAGCA AAGCATTGGT CAGAAAATAT GGAGAAAGCT GTAAGACAGT CCTATCACTG 540 

GAATGTTGAG TATAATAGAT ATGAGTTGGC TATAAATTAT GAAAATATAG AAGCGATACT 600 

CCGCCTACGG AGAAAAATAC AAGCTTTTAA GAGAGAAGTG GGAGATACCT TGTTTATTAG 660 

TCCTCGCTAT CCTAATGAAG CTACATGGAG AGAGGAAGCC CTAAAAGAAT TGCCAGACTA 720 

TTTTGATACT ATTTTTTAGA AACTTTGGCC ATGAGCCTTT ATACTCAGCA CCTA 774 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(a) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



. GTAGAGGTCA 


TTGAAGAAGC 


60 


: CAACTATAAC 


ACACCTGCAC 


120 


GGTTGAACTT 


TTGCAAGAAG 


180 


CTTTCACACC 


GCTCTCCTTG 


240 


AAGTTTTTCA 


GATTTTACTT 


300 


AGAGGACATT 


GCTCAGCTCT 


360 


TATTGGGGTC 


ATGCAAGAAG 


420 


TCTTGTCAGG 


TTTTGTTAAA 


480 


AAGCGAGTTT 


AGTAGCACTT 


540 


AAAAAATGAA 


ACTAGAACAT 


600 


TTGCCATCGC 


CCACAAGTTT 


660 


CAATCTCAGA 


AGAATTGCTC 


720 


CAGGAGATGT 


ATCAGATTTT 


780 


TGGGTTCAGT 


AGATGTTTTG 


840 


AGATGACAGA 


AGCAGATTTT 


900 


TGACACAATC 


AGTCTTGAAA 


960 


CTAGTGTTGT 


TGGTTTGATG 


1020 


GCTTGATTGG 


CTTTACCAAG 


1080 


ATGTGATTGC 


TCCAGGAAAT 


1140 


AAGGAAGCTA 


CACTAG 


1196 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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<xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


9: 






CTGGGAAAAC 


CACCCTTTTC 


AACCTTTTGA 


CCGGTGTTTA 


TGAACCAAGC 


GAGGGAACAG 


60 


TAACCCTAGA 


TGGTCACCTT 


TTGAATGGGA 


AATCACCTTA 


TAAGATTGCC 


TCTTTGGGAC 


120 


TTGGACGTAC 


TTTCCAAAAT 


ATCCGTCTTT 


TTAAAGATTT 


AACAGTTTTA 


GACAATGTTT 


180 


TGATTGCTTT 


TGGAAACCAT 


CACAAACAGC 


ATGTTTTTAC 


TAGTTTCTTA 


CGCTTACCAG 


240 


CTTTTTACAA 


GAGTGAAAAA 


GAATTAAAGG 


CTAAAGCTTT 


GGAATTGTTG 


AAAATCTTTG 


300 


ATTTAGATGG 


TGATGCAGAG 


ACTCTTGCTA 


AAAATCTTTC 


CTACGGACAA 


CAACGTCGTT 




TGGAAATTGT 


TCGTGCCCTT 


GCTACGGAAC 


CCTAAAATTC 


TCTTCTTAGA 




A*>n 


GCAGGTATGA 


ACCCACAGGA 


AACAGCCGAA 


TTGACTGAGT 


TAATTCGTCG 


TATCAAAGAT 


AftO 


GAGTTTAAGA 


TTACAATCAT 


GTTGATTGAA 


CACGATATGA 


ATCTGGTCAT 


GGAAGTAACA 


34 U 


GAACGTATCT 


ACGTACTTGA 


ATATGGCCGT 


TTAATCGCTC 


AAGGAACTCC 


AGACGAAATT 


OlIU 


AAGACCAATA 


AACGCGTTAT 


CGAAGCTTAT 


CTAGGAGGTG 


AAGCCTAATG 


TCTATGTTAA 


660 


AAGTTGAAAA 


TCTTTCTGTG 


CATTACGGTA 


TGATCCAAGC 


AGTTCGTGAT 


GTAAGCTTTG 


720 


AAGTTAATGA 


AGGAGAAGTT 


GTTTCCCTTA 


TCGGTGCCAA CGGTGCAGGT AAGACAACTA 


780 


TTCTTCGCAC 


CTTGTCAGGT 


TTGGTTCGAC 


CAAGTTCAGG 


AAAGATTGAA 


TTTTTAGGTC 


840 


AAGAAATCCA 


AAAAATGCCA 


G 








861 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 582 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



CTATGAGAGA 


GAATGACCTT 


CTCTTGATTA 


CTGCGGACCA 


TGGAAATGAC 


CCAACGTATG 


60 


CAGGAACGGA 


TCACACTCGG 


GAATATATTC 


CATTGTTGGC 


CTATAGCCCT 


GCCTTTAAAG 


120 


GAAATGGTCT 


CATTCCAGTA GGACATTTTG 


CAGATATTTC 


AGCGACTGTT 


GCCGATAACT 


180 


TTGGTGTGGA 


AACTGCTATG 


ATTGGGGAAA 


GTTTCTTAGA 


TAAATTGGTA 


TAAGATGACG 


240 


CGCTATGCTT 


TGCTGGTGAG 


AGGTATCAAT 


GTTGGTGGTA 


AGAATAAGGT 


CGTCATGGCG 


300 


GAGCTTCGTC 


AAGAATTGAC 


AAACTTGGGA 


CTGGAAAAGG 


TTGAGAGCTA 


CATCAATAGT 


360 


GGCAATATTT 


TCTTTACTTC 


GATAGATTCC 


AAAGCCCAAT 


TGGTTGAAAA 


GCTAGAGACT 


420 


TTCTTTGCAG 


TCCATTATCC 


ATTTATTCCA 


GAGCTTTTCT 


TTACTGAGTC 


TAGAGGACTT 


480 


TGAGGCGGAA 


CTTGAAAATC 


TACCAGCTTG 


GTGGAGCACA GACTTGGCAC GAAAAGATTT 


540 


TCTCTTTTAC 


ACTGAGGGTT 


TGGATGTGGA 


CCAACGTCAT 


CG 




582 



(2) INFORMATION FOR SEQ ID NO: 11: 
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<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 698 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CTGATTTTTT ACAATACCTG CGAAATCTTA GATTTCTTGT GCGGACATGG AAGAGTCGGC 
CAACGGACGT TGATTTGTAC CATCTGTAAT ATGAACAAAA CCTCGTACAG TTGGGATTCC 
ATAGCGTGAG CGGAATGCTT GCAAATCATT GAGTTGGCTT GGTTCTTCAC TATTGATGAA 
GTAAATGTGA GCTTTGGTTT CAGCTACGAC ACCTGACAAT GTACCTGCAA ATTTACGGCA 
GTAAGGGCAA GTTTTGCGAC CGATAAAGAA GGTTGCAGTT TCTTTTTTAT CAAGAGCTTC 
TTGCGCACGC ACAACTGTAG TGACTTCAAG GTCTTTGATG TTATCTAAAA ATTGTTCCAT 
GAGATTACCT CGCTTTCATT GATAAGTCTA GTATGCCATA AAGTTTCTAA AATTGCTTAG 
ATTTGATACG AAAAAAAATG AGGTTGGTTG GTCTCATCTT TTATAGGTCT TTATTTTACA 
AATGCATTGA TTTCTGCTTC GATGTTAGCA ATCTTAGCTT GTGATTCTTC GTTGGTTTCC 
CCTACAACTG CAATGTAGAA CTTGATTTTT GGTTCTGTAC CTGAAGGGCG AACGGCAATC 
CATGAACCGT CAGCAAGTTG TGTATTTCAA CACATCACTT GGAGGAGTTG TCAAGTTTGT 
AACAGTACCG TCAGCAACAG TAGCAGTTTG TGCCTTGA 

(2) INFORMATION FOR SEQ ID NO: 12: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
698 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 707 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

TCAAGNGGAT TCGTTTCGGC GGAGNTATGA CGACGGGAAA AACCTTGAAC CAATTTTTGA 
TTGAGATGGA TGGTTTTGAG GGAAATGAAG GGATTATTCG TCATCGCTGC GACAAACCGT 
TCAGATGTAC TTGATCCTGC CCTTTTGCGT TCCAGGAAGT TTTGATAGAA AAGTATTGGT 
TGGCCGTCCT GATGTTAAAG GTCGTGAAGC AATCTTGAAA GTTCACGCTA AGAACAAGCC 
TTTAGCAGAA GATGTTGATT TGAAATTAGT GGCTCAACAA ACTCCAGGCT TTGTTGGTGC 
TGATTTAGAG AATGTCTTGA ATGAAGCAGC TTTAGTTGCT GCTCGTCGCA ATAAATCGAT 
AATTGATGCT TCAGATATTG ATGAAGCAGA AGATAGAGTT ATTGCTGGAC CTTCTAAGAA 
AGATAAGACA GTTTCACAAA AAGAACGAGA ATTGGTTGCT TACCATGAGG CAGGACATAC 
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CATTGTTGGT CTAGTCTTGT CGAATGCTCG CGTTGTCCAT AAGGTTACAA TTGTACCACG 540 

CGGCCGTGCA GGCGGATACA TNATTGCACT TCCTAAAGAG GATCAAATGC TTCTATCTAA 600 

AGAAGATATG AAAGAGCAAT TGGCTGGCTT AATGGGTGGA CGTGTAGCTG AAAAAATTAT 660 

CTCTAATGTC CAAACTACCA GGAGCTTCAA ACGACTTTGA ACAAGCC 707 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CTCAATCAAT TCTTCATTAG TAGTTGATTG ATGCAGTTGA CTTTCGATTT GTTCACTCTT 60 

GCGGTCGATT GAACGAAGGG CTGTTAGGTA AAGTCTGCAT TGCGATAAAG AATTTGAAAG 120 

ATAAAACGTG AACGCATGAA GGTATAGAAA TTACGCAATC GACGGTTGAT AAAGACATCA 180 

AGGACAGGTA GTGGTTCCAA ACACGTAGTG ATAATGGTTT CCTCAGTGAT GATAATACCA 240 

AGCGGGATGG TTACCTAGTA GGTGCGGTTA TTTCTTTCCT CCGTGACCGG CACGTCTACG 300 

ATAATCAGGG TATACTCGTC CTCAATGGTA ATACGACACA TTTCTTCCGC ATCGAGCGGT 360 

GCTCGAAGGT CGGCAATATC AATATCGAAG GTGTTGGCGA TTTCGAGTGA TTCATTTTGA 420 

GTCGGATTGA CGAGATTGAT CCAAGTACCC GGTTCAAGCG TATCGATCTC TTTAAATTCA 480 

GTTGTTGTAG AGAGAAAAAC TTGTTTCATA GCCCTTAGCC TTTCTCATTC TTCAGATTTT 540 

TTCACACTGT ACTATTATAC TACAAAATCG G 57! 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 761 base pairs 

(B) TYPE: nucleic acid- 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TTGNGGAGAG ACTTNAGTAT TTTGTGGCAG ACAGACCTCG ATAATCACCA CAGGCACTAG 60 
GAACCAGTGG ACACTGCAGA GTGAGTGCTT CACCACTGCG CATAGATTTA CCAAGCAACT 120 
TCTTTGCAAT AGCACCAGCC ACGAACCTGC TACCAAACTG TTTTTCAGCA TAGAGTAGGC 180 
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AAACATAAAC CCGCAAAGGA CTGGAAGCAT CAAACCAAAG GCAGCTCCAC CAATTTTCAT 240 

GGAACATAGA AGCTAAACTC ATGGTAAGAA CCAAGATTGC CAAGGTTTTC ATTTGGAACA 300 

CTCAAAGCAC CGTCAATCAA GAAGGCAAGG GCAATCATGA TACCACCACC GATAACGAAT 360 

GGTAACATTT GAGATACACC ACTCATCAAG TGTTTGTAGA AGGCACCACC AAGGCTTTGT 420 

TTTTCGTTAG AGGCTGTTGC GACTTTGGCA CCATTAGCGG CACGGTAGAC TTCCGCATCT 480 

CCTGAAATAG CCAAGTTGAT CAATTCTTCT GTCTTACGGA TACCGTCAGC AACTGGACGA 540 

TTGATCAAAG GTTTGCCATC GAAACGATCC ATTTCAACGG CCTTGTCTGC TGCAATGATA 600 

ATAGCTTTAG CCTTACGGAT ATCTTCTGCA GTTAGTTGAT TTCCAACACC GCTAGCACCG 660 

TTGGTTTCGA CCTTGATACC AACCCCCATT TCAGCAGCTA CTTTTTGAAG GGCTTCTTGG 720 

GCCATGTAAG TGTGGGCAAT ACCTGTTGTA CAAGCTGTAA C 761 

(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



CTCTAAAACG AATTCTCTAC AAGCACCGCA AGGGCATGGC TGAACTTCCA CCATAAGGTG 60 

GTTTGTCTCG AAAGGCTAAT ACTTCNTTAA CCTTAGTTTG TCCTGAAAAT TGGTACATAT 120 

TGAAGAGGGC CGCCCGTTCT GCGCAGAGAT GGAAAACACC ACAGGTTCCC TCCATACAGA 180 

ATCCTGTAAA TATTTGTCCA TCTCCTGCTT CTACTGCAGC TACAACATGA TTGGCATAAA 240 

CAAAGTCTGA TACTTCATGT GGATTGTATA GTTTCTGTGC TTCTTCGTAC ATCTTTTCCC 300 

AGATGTCCAT TATTGTATCC TCTTTATTTA GAGATTTCTT TTAGAATGTT TTCGATATGC 360 

TGAATTGATT TTTCACGTCC AAGCAAGAAA ATTGTATCTG GTAATTCTGG CCCATGCATT 420 

TCGCCTGAAA CTGCGATACG AATAGGCATG AAAAGATTTT TCCCTTTAAT ACCTGTTTCT 480 

TTTTGGACTG CTTTAATTTG CGGGAAGATA TTTTCTGTCA CAAATTCATC ATCTGTCATC 540 

CGCTTCAAGT TTTGCTTTGA ATGCTTCAAG AACTGTTGGA ACTGTTTCAC CCGTCATGAC 600 

TTCGCGCTCT GCTTCTGTCA ATTCCTGGGA AATCTGAGAA GAAAGATCTG TCCATGGGAT 660 

ATCTCATCTA CTGATTCATT GTGGTTTATA GAG 693 

(2) INFORMATION For --Q ID NO: 16: 



(i) SEQUENCE CHARACTER . .' *.TCS : 

(A) LENGTH: 713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CTGTATCTAG TGGCAACGCC GATTGGCAAT CTAGATGATA TGACTTTTCG TGCTATCCAG 60 

ACCTTGAAAG AAGTGGACTG GATTGCTGCT GAGGATACGC GCAATACAGG GCTTTTGCTC 120 

AAGCATTTTG ACATTTCCAC CAAGCAGATC AGTTTTCATG AGCACAATGC CAAGGAAAAA 180 

ACTCCTGATT TGATTGGTTT CTTGAAAGCA GGGCAAAGTA TTGCTCAGGT CTCTGATGCC 240 

GGTTTGCCTA GCATTTCAGA CCCTGGTCAT GGATTTGGGT TAAGGCAGCT ATTGGAGGGA 300 

AGAAATTGCA GTTGTTACAG TTCCAGGTGC CTCTGCAGGA ATTTCTGCCT TGATTGCCAG 360 

TGGTTTAGCG CCACAGCCAC ATATCTTTTA CGGTTTTTTA CCGAGAAAAT CAGGTCAACA 420 

GAAGCAATTT TTTGGCTCTA AAAAAGATTA TCCTGAAACA CAGATTTTTT ATGAATCACC 480 

TCATCGTGTA GCAGACACGT TGGAAAATAT GTTAGAAGTC TACGGTGACC GCTCCGTTGT 540 

CTTGGTCAGG GAATTGACCA AAATCTATGA AGAATACCAA AGAGGTACAA TTTCTGAATT 600 

GCTGGAAAGC ATCTCTGAAA CGTCTCTCAA GGGTGAATGT CTTCTGATTG TTGAAGGTGC 660 

CAGCAAAGGT GTGGAAGAAA AAGATGAGGA AGACTTGTTC TTAGAAATCC AAG 713 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 670 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CTTTAAGGAA AATCAAATCT CTCATGCTGA TACCTCTCCT CATTAAATTA AATAGTAAAA 60 

AAGATTCTAT CTCACTCCCT GATTATTACA AAACCATTGA AATATCACAA CTAATAGGCT 120 

AGAATGGACA TAGTAAGATN TAGTAGATGA GTCATTCTAC TCAAATCCAC GTTAGAAAGG 180 

ACTGCTATGC CAGACAATCT CGCGCTTCGC ATGCGCCCTA AAACCATCGA CCAGGTCATC 240 

GGTCAGGAGC NTCTGGTCGG ACCTGGAAAA ATCATCCGCC GCATGGTGGA AGCCAACCGC 300 

CTGTCCTCCA TGATTCTATA TGGCCCTCCT GGAATCGGCA AAACCAGTAT TGCCTCTGCC 360 

ATCGCTGGAA CGACCAAGTA TGCCTTGCGA GACCCCCAAC GCGACAGTTG ATAGTTAAAA 420 

AGCGACTGCA AAAAATCTCG GAAGAAGCTA AATTTTCTGG TGGTCTCGTC CTATTGCTAG 480 

ACGAAATCCA CCGACTAGAT AAGACCAAGC AAGACTTCCT CTGGCCTCTC TTGGAAAGTG 540 

GACTGGTCAT CATGATTGGA GCAACGACTG AAAATCCTTT CTTCTCTGTC ACTCCTGCCA 600 

TTCGTAGCCG AGTTCAAATT TCCGAGTTGG AACCTCTGTC TAACCAAAAC GTCAAAGGAG 660 
GCCCCGCAAA 



670 
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(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 575 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CTCTTAGTAT CNTGATTATG GCGACTTATG AAACCTTCNC GGCGGTCTAT GATGCGGTCA 60 

TGGACGAATA GTTTATGCGA CAAATGGACG AATTTTTCTC ATGCGTCATT TGCCTAAGAC 120 

CGAAGGAGAG AAAGAAACTC TTGGAATTGG CTTGTGGGAC AGGAATTCAA TCAGTGCGCT 180 

TCTCTCAGGC TGGTTTTGAT GTGACTGGAC TTGACTTGAG TGCGGATATG TTGAAGATTG 240 

CGGAGAAGAG GAGCAACTTC AGCCAAGCAA AAGATTGCTT TTATAGAAGG TAATATGCTA 300 

AATTTGTCCA AGGCAGGGAA ATACGATTTT GTCACGTGTT ATTCGGACTC TATCTGCTAT 360 

ATGCAGGATG AGGTGGAAGT AGGGGACGTC TTTAAGGACG TGTACAATGC GCTGAATGAA 420 

GAAGGAGTTT TCATCTTTGA CGTGCACTCG ACCTACCAGA CAGATGAAGT ATTGCCCTGG 480 

CTATTCCTAC CATGAAAATC TCGGAAGATT TTGCCATGCT TTGGGATACA TATGAGGGCG 540 

GAGCTCCTCA CTCCATCGTG CATGAGATCG AGCCT 575 

(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CTTGTCTTTA ATTGATAATG ATCCTTTTAA AGATGATGAA GATTTCGTGT CAACCTATGA 60 

CCTTGATAAA TCTTTTATTT CTATGGTATC TGTTGATGTA TCAGAATACC TAGGAAGTCA 120 

GGAACCCATT AAAAAGACTT TGACCATACC AAAATGGGCA GATAAGTTGG GACGAGAAAT 180 

GGGACTTAAC TTTTCTCAGA CTTTGACAGA CGCTATTGCA GATAAGAAAG TTCAAGCCTA 240 

AGATTAAGCG TGGTATCGTG GTAATAGTCA ATAAAAAAGC ACGTCCACTT GTGCTAGTTA 300 

CTCGCCTATT GATTTTATAG ATTTATTAGC CCTTTCGAGG GCTTTTTATA TTGATTTTTA 360 

TGAGAAATAA AGAAATCAGA CTTTTAAGAA AATCATTGAT ACCAAGGGTT TAAATGAGGT 420 

AATATGGTAT AATTAGGACA TAAAATAATT TTGTGGTAAG ATGGTAGTAT CTATTTTAGC 480 
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507 



CTCGTACAAA CGGTATGGGA TTTGCACGTG ACAAGATGGT TATTACCAGT GTAAACAAAA 
ACCTAGAATT GACAGCTAAA TGGATTGATG CACAATACGC TCCACTCCAA TCTGTGCAAA 
ATAACTGGGG AACTTACGGA GATGACAAAC AACAAAACAT CTTTGAATTG GATCAAGCGT 
CAAATAGTCT AAAACACTTA CCACTAAACG GAACTGCACC AGCAGAACTT CGTCAAAAGA 
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ATATTTCCGA GCAACGGGGC GATTAAA 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 666 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

CTTTGACAAA GAGTATGACA CCTGACCGTG AAGTCATTAC CTTTATTCCT GAAAAATTTA 
TTGTGGATGG TTTCCAAGGG ATTCGTGACC CACGTGGCAT GATGGGGGTT CGCCTTGAAA 

TGCGTGGTTT GCTTTATACA GGACCTCGTA CTATCTTGCA CAATTTGCGT AAGACGGTTG 180 

AGCGTGCAGG TGTTCAGGTT GAAAATGTTA TCATTTCACC ACTAGCAATG GTTCAGTCTG 240 

TTTTGAACGA AGGGGAACGT GAATTTGGTG CTACAGTGAT TGATATGGGG GCAGGTCAAA 300 

CGACTGTCGC TACAATCCGT AATCAAGAAC TCCAGTTCAC ACATATTCTC CAAGAAGGTC 360 

GAGATTATGT AACTAAAGAT ATCTCCAAGG TTTTGAAAAC CTCTCGCAAA TTAGCGGAAG 420 

GCTTGAAACT GAATTACGGG GAAGCCTATC CGCCTCTTCC AAGCAAAGAA ACCTTCCAAG 480 

TAAAGGTTAT TGGAGAAGTA GAACCAGTCA AAGTCACGGA AGCCTACTTG TCAGAAATTA 540 

TTTCTGCACG AATCAAGCAC ATCCTTGAAC AAATCAAGCA AGAATTAGAT AGAAGGCGTC 600 

TGGTTGGACC TCCCTGGTGG TATTGTCTTA ATCGGTGGGA ATGCCATTTT ACCAGGTATG 660 
GTTGAG 

666 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 703 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ 4 IDNO:2l: 
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CTGAAGTAGG AGGACCACTA GCTATCCTAG ATTCATACTA TGGTAAAGTA ACAACCATGC 
CTGATGATGC CAGAATGGCG TTTGGATCTT ATCAAAGAAT ATTATGTTCC TTACATGAGC 
AATGACAATA ACTATCCAAG AGTCTTTATG ACACAGGAAG ATTTGGACAA GATTGCCCAT 
ATCGAGGCAG ATATGAATGA CTATATCTAC CGTAAACGTG CTGAATGGAT TGTAAATGGC 
AATATTGATA CTGAGTGGGA TGATTACAAG AAAGAACTTG AAAAATACGG ACTTTCTGAT 
TACCTCGCTA TTAAACAAAA ATACTACGAC CAATACCAAG CAAACAAAAA CTAGAGGTTG 
ATTATGGGAG ATAAGAAATA CACAGTAGAA AAAGCCAATA GTTTTATAGC AGAAAATAAA 
CATCTCGTTA ATACTCAATA TAAGCCTGAA GGACATTTTT CAG 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



300 
360 
420 
480 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CCACAACAAC TCTCGTNTGA TCACTATGGG GGCACAACTT GTTGGTGATG AATTGGCTAA 60 

AAACATCGCT AAAGGATTTG TTAATGGTAA ATACGACCGT CNACATCATC AAATCCGGGT 120 

TGACATGTTG AACAAAATGG GCTAATTAGA TTACAGTAAG AAAGGTAAGT TAAAAATGAG 180 

AATTGCAATT GGATGCGACC ACTTCGTAAC AGATGAAAAA ATGGCGGTTT CAGAATTTTT 240 

GAAATCAAAA GGATATGAAG TCATTGACTT TGGTACATAT GACCATACAC GGACTCACTA 300 

CCCAATCTTT GGTAAAAAAG TACGGGAAGC TGTAACTATC GGCCAACCTG ATCTTGGAGT 360 

ATGTACCTGT GGTACTGGTG TTGGTATCAA CAACCCTGTA AATAAAGTCC CACGTGTTCG 420 

CTCTGCCTTG GTTCGTGATA TGACAACAGC CCTTTATGCT AAAGAACAAT TGAACGCCAA 480 

CGTTATCGGT TTTGGTGGTA AGATTACTGG TGAATTGCTC ATGTGCGATA TCATCGAAGC 540 

TTTCATCCAT GCTGAATACA AACCATCTGA AGAAAACAAA AAATTGATTG CGAAAATTGA 600 

ACATGTTGAA ACAffACAATG CTCAACAAAC AGATGCAAAC TTCTTCACTG AATTNCCTTG 660 

AAAAATGGGA TCG 673 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



CTACTTTCCG 


CCAGAGTTTT 


GAGGTCGCTA 


GAACATTTTG 


CTTATCGATA 


CTAGTAACGA 


60 


TTTTTCTGCG 


ATTTCTTGCA 


ATTTCAAAGG 


CTTTGCGAAT AATCCGCTCC ACTTCCTCAT 


120 


AGCTATAGTC 


GTTGATATCA 


CGCGCTTTGC 


GCTCTTCAAG 


AATATGATCT 


CCAAAGTAAA 


180 


TCTCGCCTAA 


TCAATTAACG 


CACCACGACA 


AAGTATACAC 


CAGAAATTCG 


TTCCGGTTTG 


240 


AGTGGTGACA 


AATACTTGAG 


ACTGTCAAAG 


ATTTTTACAG 


GACGAATATT 


AGCGTAAAGA 


300 


TTGAGTTCTT 


ACGGAGAGCC 


ATCAGGCCTT 


GTTCAGGGCG 


AACCGCTACT 


CCATCATACT 


360 


GAGGACTACC 


GATAGCTACT 


AGTAGGATAG 


CATCTGCTTC 


CCTACTTGCC 


TTAAGGGTTT 


420 


CATCAGGTAA 


GGGAGGTCCT 


GCTGCATCAA 


TATCTGCACC 


TCCGAACGGT 


CGTCTGTCAA 


480 


TCTNCATAGT 


CAAAACCTGT 


TTTTTCAG 








508 



(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



CTCTTGTCAG AGAAATTTAC AAAACGTTAG GAGAATAAGA TGGCATTTAT TGAAAAAGGT 60 

CAAGAAATCG ATATGGAAGT CATCAAGGCT GAAACCCAAT TGTCTGCAGA AGCCTTGAGA 120 

CTCAAGGAAA GCCGTGACAG GGAATTGGCA GATATTATTT CAGGGGAAGA TGACCGTATT 180 

CTCTTGGTGA TTGGTCCTTG CTCTTCTGAT AATGATTAGG CGGTCTTGGA ATATGCTCGC 240 

CGTTTATCCG CCTTGCAAAA GAAGGTAGCG GATAAGATTT TCATGGTCAT GCGCGTGTAT 300 

ACTGCTAAGC CTCGTACCAA TGGAGACGGC TATAAAGGGT TGGTTCACCA GCCAGATACT 360 

TCTAAGGCTC CAACCCTGAT TAACGGCTTG CAGGCTGTGC GCCATTTGCA CTACCGCTTT 420 

GATTACAGAT ACTGGTTTGA CAACGGCAGA TGAGATGCTT TATCCGTCAA ATCTGATCTT 480 

GGTGGATGAC TTTGGTCAC 499 



(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



CTTGTCATAA TTTACAATTT AGTTAATAAG ACCAAAAAGG TCATATATAT AAAGGAGTCA 

CAAAAATCAT GAACAAACGT GTAAAAATCG TTGCAACTTT GGGTCCTGCG GTAGAAATCC 1 

GTGGTGGTAA AAAATTCGGT GAGGACGGAT AC TGGGGTG A AAAACTTGAT GTTGAAGCTT 1 

CAGCTAAAAA CATTGCTAAA TTGATTGAAG CTGGTGCTAA CACATTCCGA TTCAACTTCT 2 

CACACGGCGA CCACCAATGA ACAAGGTGNA GCGTATGGCA ACTGTTAAAC TTGCGGAAAA 3i 

AATTGCAGGT AAAAAAGTTG GTTTCCTTCT TGATACAAAA GGACCTGAAA TCCN 3! 

(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

CTTCTTTACT GAGATGGATT TTCCTTTTTA GTTCGTGCCA ATCTCAAGGC ACGATTTGGA 60 

TTGAGACGAT TAAATAGTTC TTCCAATTTC TTTTCATTCC AAACGTCTGC GGCGGAAACA 120 

AAATTACCAT CCGCATCTCG GAAAGATATC GGTTTATCTC CCATACCAGT CTCCTAGTCT 180 

ACAAATTCAA ACTCAAATTT ACCAATGCGG ACCAAATCCC CATCTTTAGC TCCACGCGCA 240 

CGAAGGGCTT CATCAACCCC CATACCACGA AGCTGACGGG CAAATTTCAT GACAGATTCA 300 

TCACGATCAA AGTTGGTCAT ATTAAAGAGT TTCATGAGTT TTTCACCAGA AAGTACCCAT 360 

GTCGCATCGT CATCACGACT AATTTCAAAG GCTTTTTCTT CTTCGTCAAA TCCATAGTAA 420 

GCTTCTTCTT CCATATCAGA CTCGTCGTAG AGCAAAAATT CTGGTGTCTT GTCTAACAAT 480 

TCAGCTGTAG CATCTAAAAG TGTTGCCAGA CTTGCTTGGT CAATCCAGAA ATTGGGAAGA 540 

(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



GACTAAAGGT GCAGGTGCCA GATGATCGTC ACCAACGTAT CAGATAGTCA ATTACTTTCC 60 

CAAATCTTAC TCAATCTTTT ATCCAATGCC ATCCGTTACA CTGAACAAGG GGGAAAAATT 120 

GAGGTCAAGA CCCAAAAGGT GAACGAAGGC ATTAAGATTT CTGTATCAGA TACAGGGCAT 180 

TGGTATTAGT CAATTAAAGC ATGATCGTAC CCCTGACCGT CACTACCGAG TTACCAAAGG 240 

TCGAAACTGA GATTCCGGTT AGCCACGGTG GTGGCCTTGC CATTGTCAAA GAACTCAGTC 300 

AATTTATTAG GTGGCCGNAG TCACGGTGAC GAC 333 

(2) INFORMATION FOR SEQ ID NO:28: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

CTGAACATAG GCTTAGAAGG AGTCTACCAG CAGGAGAATG CAGCCTTGGC GTTGCAAACT 60 

TTTCTTCTCT TTATGGGAGA AGGANNGGAA GCTGTCGATG AACAGGCTGT AAGAAAGGCC 120 

TTGGAACACA CCCATTGGGC TGGTCGCTTG GAGCGTATTC GCCCACAGAT TTACTTGGAT 180 

GGTGCTCATA ACCTCCCTGC CTTGACTCGC TTGGTTGACT TCATCANCCA ANAANTATCA 240 

TGAGGGTCAT CGTCCTCCAA TCCTTTTTTG GA 2?2 

(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1159 base pairs 

(B) TYPE: nucleic acid- 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CTGCATCAGC CAAAGCTTTC ACACGTCCGT GATATAGATA TCCACCGCGG TCGAACACCA 60 
CTTCTGAAAT ACCTTTAGCG TTTGCACGTT CTGCAACGAG TTTACCGACA GCAACGGCTT 120 
GTTCAGTTTT AGTTCCTTTT GAAACTTCTT TATCAAGAGT TGAAGCACTT GCGAGCGTTA 180 
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CACCCGCTAC GTCATCAATC ACTTGAGCGT AGATGCCTGT ATTAGAACGG AATACGTTCA 240 

AACGTGGGCG ATCAGCAGTT CCAGAGAGTT TTCCGCGAAC GCGACGGTGG CGTTTTTGGC 300 

GGAGTTTGTT TTTATCTGGT TTTGAAATCA CAGTTTTCAC CTCTTTAGTT TTAAATCGTG 360 

TGCTATGCAC AAAGTTGGAA AATAGGTTGG TGGTTGATGA TCAACCACTC AACATTATTT 420 

ACCTGTTTTA CCTTCTTTAC GGCGAACGAA TTCACCAACG TAACGGATAC CTTTACCTTT 480 

ATATGGTTCT GGTGAACGAA GGCTACGTAC GTAAGCAGCT GTTTGACCAA CTACTTCTTT 540 

TGAAATTCCG CTAACAACGA TTGTTGTTGG GTTTGGAAGT TCAAAAGTAA TTCCTTACAG 600 

CTTCAACTTC GTCTGGATGA GATTTACCAA CAGCCAAAAC AAGTTTAGAT CCTTGAAGCT 660 

GTGCACGGTA ACCAACCCCA CGCATTTCAA GTTCTTTCTT GAATCCTTCT GATACACCAA 720 

CAACCATGTT GTTCAAAAGG GCACGAGTAG TTCCGTGGAT AGTTTTCATT TCTTTTGAAT 780 

CGTTTGGACG GTGAAAAGTT ATTTCAGTAC CTTCCACACG GATTTCAATA TCTTTTGAGA 840 

ACTCACGAGT AAGTTCTCCT TTAGGTCCTT TTACAGTTAC AACGTTGTCA TTGTTAGCGA 900 

GTTCAACACC AGCAGGCAAC ACGATAACTT TATTACCAAT ACGTGACATG TTTATTTTCT 960 

CCTGTTAAAT TGTCAGGCCA GAACGGCCAG TTTTCACGGG GTTCAAATAC TTATTTAGTT 1020 

CAAGAGCCAT CCAAACTCTC CCAGCAGACG ATGACCACCC AGTAGATACC ATCGCCCAAG 1080 

TCTTTCAAAA AGGCTACAAA CTCCATGAAC GCATCCCTAC GCCCAGCCAA TGGTAGTGGT 1140 

GTTATAACTA AGAATACAA 1159 

(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 901 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



CGATCGGACA 


AGAATCTATT 


CTGTAAATTG 


CGACACGAGG 


AGTCGAAATC 


GATTTTACTT 


60 


CGACGACGAG 


TTAGTAAGGA 


AGCTAGACAA 


ACGCCATAGC 


GATTGCCGTT 


TTCTGACGAG 


120 


TTGCTTTAGC 


TACCGTCAGA 


ATTGCCGAAT 


CGAACACGCC 


CTAAGCTCTG 


AGTAAAAAAG 


180 


ATAGATTTTC 


TTGTGTTCAT 


CGAACAATTC 


GTCAATCTCC 


TATTTTTACT 


GCGAGCTTTT 


240 


TACGGGCTTT 


GTATCTTAGT 


TATACACCAC 


TACCATTGCT 


GGGCGTAGGA 


TGCGGTCATG 


300 


GAGTTTGTAG 


CCTTTTTGAA 


AGACTTGGGC 


GATGGTATCT 


ACTGGGTGGT 


CATCGTCTGC 


360 


TGGGAGAGTT 


TGGATGGCCA 


TATGGTAGTT 


ATGGTCAAAT 


TCGCCATCTG 


CTGCGATTTC 


420 


TTCAATTCCT 


TCTTCTTTCA 


AAGCGTGAAT 


CAAGCTTTCT 


TGCACCATCC 


CCAAGCCCTT 


480 


CTTCACATCA 


TCTGTCAAAC 


CTTCAACTGC 


AAGTGCACGC 


TCAAGGTTGT 


CAAGAGATGG 


540 


TAAGATTGCT 


TTTGCCAAGT 


CCTGGCTACG 


ATAACGTTGC 


AAGTTTTGAC 


GTTCTTCATT 


600 


GGCACGGCGT 


TGGATATTTT 


GCATTTCTGC 


ATGAGCGCGA 


AGATATTTGT 


TTTCGAACTC 


660 


ATCTGCACGT 


TCATTTGCCA 


AGTCCAACTC 


AGACTTTTCA 


GGAGTTGTTT 


CTTCAGCTGT 


720 


TTCCACAACT 


TCCTCTTCTT 


GAACTTCTTC 


TACTTCTCCA 


TTTTTTATAT 


CTTGGGCCAT 


780 
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GTTCGCCTCC CTTTAATGAT TTCAATCTTA ATGTACTTCA TAACGATTAC TGCTGAGGTA 840 
GCGGTAAAAA TCTGTCNNCN TCATAGTCAA AACACGGTGA ACACATTGAC TNGATTGATA 900 
A 901 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 766 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

CTNTTGTCAA AAGTATTGCT ATCCTTATTC ATAGAATAAA GAATACGGCG CTGAACCGGC 60 

TTCAACCCAT CACGAATATC TGGCAAAGCC CGGTCTTGAA TAATGTACTT GGAGTAGCGA 120 

CCAAAGCGCT CTCCCATGAT GTCCTCCAGG GACATGTTTT GAATGTTAGA CATAAGATAC 180 

AAAGCCCATA AAATACCAAG TGAAAATAGA AAATTCTTGA AGTAAGCAAA CTCACAAGAG 240 

AATTTATCTT TTTCACACAG TATCTAGGGC GTGTTCAACT CCTTTCAAAG AATGTAGAGT 300 

AGGTTTTTAT GCAGTAAAAG ATATTTTACG GGAATTCGTC CCGTGTTCAG TTACGATAAG 360 

TAACCAAACT ATCCTGTTTG TATTTTTCAA TATGAAAATA TGGTTTTCCA AAATTAGTCT 420 

TAGTTTGTGT CTTAGCCGCT CCCTTAAACG CCTCTTTGAG ATAAGCACTC ATAGCAGATT 480 

CTTCATTAAT AATCCTGCAA TTTTTTCAAA CCAAGATTTT CAAACTGCTT TTTCACATAG 540 

TCATTCACAT CCGACTCTAA TTTCCAGTTT ACTAACATAT TATTTTCTTT CATTAAAACA 600 

CTGTCGCTTC TTCTAGCGTA AACTTGACAT TATCTTCAAT CCATTTACGG CGTGGTTCTA 660 

CCTTATCTCC CATGAGAACA TTGACGCGGC GTTCGGCGCG CGCTAAATCT TCAATTGTGA 720 

CACGGATGAG GGTACGTGTT TCTGGGTTCA TGGTTGTTTC CCAGAG 766 

(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

CTCGTGTTCA AATTGGCAAG CCCAATTGAA CCCGGGCTAA ACTCTGTGTG AAAAAGATAA 60 
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ACTTTCCTAG AAACTTTAGT TTCTTCGTCA AGTTTCCTAT TTTTACTTGG AGTTTTGACG 120 

CTCTTGATAT CTTAAATTAC CAAGATGCTT TTGTTACACC AGGAATTTGA CCTTTATGCG 180 

CAAGTTCGCG AAAAGCGATA CGACTCAGAC CAAATTTGCG GTAAACTGAA TGTGGGCGCC 240 

CCGTAACTCT ACAACGATTA TGTAAACGAG TCGGTGAGGC GTTGCGAGGT AATTTAGATA 300 

AACCTTCGTA GTCCCCTGCC GCCTTTAATG CAGCACGTTT TTCAGCATAA CGGTCAACAA 360 

TTTTTTGGCG TTTAGCCTCT CTAGCTACCA TTGATTTTCT AGCAATTAGA TTTACCTCCT 420 

ATATTATTTT GCAAAAGGCA TTCCAAGGCC TTGTAAGCAA TGCACGTGAC TCTTCGTCAG 480 

TGTTAGCAGT TGTTACGATA ACGATGTCAA GACCACGAGT TTTGTCAACG TCATCGAAGT 540 

TGATTTCTGG GAAGATTAAT TGTNCTTTCA CACCAAGTGT GTAGTTCCCG CGTCCATCAA 600 

ATGATTTTGT TGGAACACCG TGGAAGTCAC GTACACGTGG AAGTGAAACT GATACCAATT 660 

NACCCAAGAN TTCGTACATA CGTTCACCAC GAAGGGTAAC TTTTGCACCG ATCGCNACAC 720 

CTTCACGAAG ACGGAAGCCG GCGATTGATT TNTTAGCTCT AGTGATNAGT GGTTTTTGAC 780 

CNGAGATAAG TGCCAATTCT CCAGCAGCTT TTTCAGGCTT TTAGCGTTTG A 831 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1194 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



CTACTCGTAC AGGTGCACAA GTTGCGGGTC CAATCCCACT TCCAACTGAA CGTAGCCTCT 
ACACAATCAT TCGTGCGACT CACAAATACA AAGACTCTCG CGAACAATTT GAAATGCGTA 
CACACAAACG TTTGATCGAT ATCGTTAACC CAACTCAAAA AACAGTTGAT GCCTTGATGA 
AATTGGATCT TCCAAGTGGT GTAAACGTAG AAATCAAACT TTAATCTAAA ATATAAAAGA 
GCAGAGGCTG GTGTTTCAAT CTAATTGAAC ACGGACTAAA CTCGGTGTGA AAAAGATAAA 
CTTCCTATTG TCCGTTAGAC ACTGCGTTAG TTTCCTATTT TCACTTTGAG TTTGACGCCC 
TTTGTATCTT AGACTTGAGC ATAAAAAACG CTCGTTAAAA ACTTTTTGAA TAAAAAATAT 
AGAAAAGGAA CTATTTTCTC ATGACAAAAG GAATCTTAGG GAAAAAAGTG GGAATGACTC 
AAATCTTCAC TGAAGCTGGC GAATTGATCC CTGTAACAGT TATTGAAGCA ACTCCAAACG 
TTGTTCTTCA AGTTAAAACT GTTGAAACAG ACGGATACAA CGCTATCCAA GTTGGTTTCG 
ATGACAAACG CGAAGTATTG AGCAACAAAC CTGCTAAAGG ACATGTAGCG AAAGCTAACA 
CGGCTCCTAA GCGCTTCATT CGTGAATTCA AAAACGTTGA AGGCTTGGAA GTTGGCGCTG 
AAATCACAGT TGAAACATTC GCAGCTGGAG ATGTTGTTGA TGTAACTGGT ACTTCTAAAG 
GTAAAGGTTT CCAAGGTGTT ATCAAACGCC ACGGACAATC ACGTGGACCA ATGGCTCACG 
GTTCTCGTTA CCACCGTCGT CCAGGTTCTA TGGGGCCTGT TGCACCTAAC CGCGTATTCA 
AAGGTAAAAA CCTTGCAGGA CGTATGGGTG GTGACCGCGT AACAATTCAA AATCTTGAAG 
TTGTACAAGT TGTTCCAGAA AAGAACGTTA TCCTTATCAA AGGTAACGTA CCAGGTGCTA 
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AGAAATCTCT TATCACTATC AAATCAGCAG TTAAAGCTGG TAAATAATAA AGAAAGGGGA 10 

AATCAGTCAC AATGGCAAAC GTAACATTAT TTGACCAAAC TGGTAAAGAA GCTGGCCAAG 11 

TTGTTCTTAA CGATGCAGTA TTTGGTATCG AACCAAATGA ATCAGTTGTG TTTG 11 

(2) INFORMATION FOR SEQ ID NO: 34: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1526 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



CTCCTCACTA 
AGCACTTAAT 
CTCCTGTTTC 
CCAGATGCCC 
TCATAAGTCC 
TTTGAGCAAT 
CTAGCATGCT 
GATTTTCTTT 
CAAGGTCTGA 
GCTGCAAGGC 
CATGATCCTT 
AGGCTTTTGA 
CATGCAATTC 
TGAAATATAG 
AAATTCGCAA 
TTGCATGTTA 
TTCAATTTTT 
ATCAGGTTCT 
GGACATGGCA 
AGATTTCTCC 
ATAAGCCACG 
TACACGCTTG 
GAGGACTTGG 
TAGAAAAGTT 
TAATTGAATC 
GACTGAAAAT 



TCTACACTGC 
CGTACTCTTC 
TAGGTAGACT 
TATGATATAA 
AATGATTTCC 
ACTAATAGCT 
GAGGGATAAT 
TAGTTGCAAA 
ACAAGAAGAC 
TAACAAACGG 
ATTGATAATT 
AGCTGTTTCA 
TAAGTCAAAT 
TCTTCTGTGC 
ATTTCTCCTG 
TGGGTAACAA 
AAAPTGGAGA 
ACTGCTAAAG 
CTCTTTTTAA 
ACTAAGGCAT 
TTATCATAGA 
CGTAGCTGAT 
CCTTCAATAT 
GATTTGCCAC 
GAAATATTTT 
GTTCCCATAC 



TTAAATTGGT 
TCTTTTAAGG 
AGGCGTTCAC 
AGGTATTGTG 
TTTGATAAGG 
TTTGAGGCTT 
TTACCCATTT 
ACAGCTTTGG 
ATGATGCTAA 
GCACAGGTCA 
AGCTTTGCCA 
AGGACAAGTT 
TGATTTCTCA 
GCTGATCTTT 
TTAAGAAAAA 
TGATAATCGT 
TAGGGTCTAA 
CTCGCGCAAT 
GATCATCTTT 
CTAATTGTTT 
TAGACATGGC 
TAAGGTTGAA 
GGCAAGAAGG 
AACCAGATGG 
TTAAGGCTTG 
TAGAAG 



CAAATGAACT 
ATTAGTTGAA 
AAATGTTAGC 
TTCCATTGGG 
CATAATAATA 
GGTGCAAAGG 
GGTGTAACTG 
CAATGCCTGC 
TCACAAATCG 
ATTCGATAGC 
TCTCCTTGTC 
GCCCTAGTCC 
TAGGTTTTTC 
TGGATTGGTA 
AGCAGTTTTA 
ATAATCCTTT 
GGCTGAAGTC 
GCAAAGGCGT 
GACTTCTTCC 
TTTGTCTCGA 
AAAGGGATTA 
TTTGCTACTA 
AACCAAATCG 
GCCTATCAAG 
AAAATCCCCG 



TTTTTACAAC 
TTAATTCAAA 
AATGTAATCA 
AATTGAGGTT 
TTGGTCAATC 
AAAGGCAACC 
TTCTTCGCCA 
CATATGGTCT 
AAGGTCAGAC 
GCTTTGACCT 
TTTGGAGGCT 
TAAAAAGGAT 
CTTCCTTATC 
AACACGTCAA 
TCTGAAATAC 
TTTAGTTGCT 
GGCTCATCCA 
TGCTGCTGAC 
CAAATGGCTG 
ATACCATGTG 
GGCTGTTGAA 
TAAATATCTT 
TTCATCCGAT 
GCAGTAATCT 
TAAAATAAGT 



CATAAAAAAG 
TCCACTAGTT 
GCGAAGCGTT 
TCTTGGTCTT 
TGTTCATCTT 
AATAAATCGG 
GGGGCTAGTT 
CCCATACGTT 
ACTTGTGGCT 
TGGTTGATAG 
AAGGCCAGTA 
TGTTCTAATT 
CGAACCGTCC 
CGGTATCTCC 
GTGAAGCTTG 
GAATGAGGTC 
TTAACAGAAT 
CGCCAGATAA 
CCCCTTTTAA 
TCCTTGGACC 
AAACCATCCC 
GCTCATCTAA 
TAAGGGTTTT 
GTCTTTCTGG 
CTAGGTGTCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1526 
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(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

CTGGGGCAGA CGCACCGAAG ATGTCAATAC CGATGNCCGC GCCATCCAAA CCAACATACT 
TGTACCAACT TTGGGTCGCT GCCATTTCAA TGGCCACACG ACGACGAGTC TTAGATGGTA 
AAATGTCTTC CTTGTAGGTA GCATCTTGAG CATCAAATAG TTCGGTTGAG GGCATAGATA 
CCACACGTAC TTTACCACCT TGTAAAACCA ATTCCTTAGC AGCTTTGATA GCTAGATTGA 
CCTCAGATCC TGTAGCAATG ATAATAGTAT CAAATCCCGG GGTATCATAC ACGACGTAGG 
CTCCTTTAGC GACCTTACCA AAGTCTGTCC CTTCTTCAAC TACCAAGTTT TGACGGGTTA 
AGACAATGAC AGTTGGAGTG GTGGTACTGG TCAAGGCATG ATGCCAAGTC GCTTGAGTTT 
CACGGGCATC CGCTGGACGG ATAACAGTCA AGTTTGGCAT TGAGCGTAAA CCTGCCAAAT 
GTTCAACTGG TTCATGAGTT GGACCATCTT CACCAACGGC AATTGAATCA TGGGTAAAGA 
CATAAGTTAC AGGCAACTCC TGAATGGCTG ATAGCCGAAT AGCAGCTTTG ACGTAGTCAG 
AGAAAACAAA GAAGGTTCCG CCATAAACTC GTAAACCACC ATGAAGAGCC ATTCCATTGA 
GGATTGTTCC CATGACAAAT TCACGTACCC CAAACTGAAT ATTGCGGTTT AATGGATTAT 
ATTTATCTTG TAAGCCATCT GCCTTGATGT AGGTCATGTT GGAGTGAGCT AAGTCTGCCG 
ATCCACCTAA GAAGGTTGGT AAAACTGCTG CTGTATTAAT AGCATCTTGG GACGAATTAC 
GAGTTGCTTG AGAGAAGCCA TTCTCATAGA CAGGGAAGTC TTTTTCAGTA ATGGTTACAG 
GGGATTTTCC AGTACAATAG CGTCAATCTC ACTAGCAACT TCGGGATAAG CAACCTTGTA 
ATCAGACACC AAACTAGCCC AAGCATCGTA TGCCTCCTGA CCACGATCCG CTACATTTGT 
CTTGAAATCA GAATATACTT CCTCTGGTAC TTCAAATGGA TCGTAATCCC ATCCCAAAAA 
CTTACGAGTT GCTCCTGTTT CTTCTGCTCC TAGTGGTGCA CCATGAACAG CATTTGTACC 
ACTTTTATTG GGTGAGCCGT AACCAATTAC CGTTTTCACT TCAATCAAAC ATCGGCTTGC 
CTGAAACTTT 7GCTGTTTCG ATAGCAGCAT GGATGGCTTC CAAGTCTGTT CCATTTTCAA 
CCAAGGCAGT ATGCCAACCG TAGGCATTGT AACGGTCACG AACACTTTCT GTAAAGGAAT 
CCTTTGTCTC ACCATCCAAG TTGATATCAT TTGAATCATA AAGAACAACC AACTTATCAA 
GTTTTTGCAA GCCTGCGTAT GAAGCTGCCT CGCTTGAGAC ACCTTCCATC AAGTCTCCGT 
CTCCACAGAT AACGTAAGTA TAGTGGTCAA AGATATTGTA GCCTTCACGG TTATATTTGG 
CTGCCAAGAA ACGTTCTGCT TGGGCAAAAC CAGTAGCAGT TGAAATCCCT TGCCCTAGAG 
GACCTGTCGT AGCATCAATC CCTGCCGTAT GACCAAATTC TGGGTGACCT GGTGTTTTTG 
AACCCCATTG ACGGAAGCTC CTTAATCTCA TCCATGCTGA CATCTTCCAA AACCAGAAAG 
GTGAAGAAGA GCATAAAGGA GCATTGAACC ATGACCTGCT GAAAGAATAA AGCGGTCG 



60 
120 
180 
240 
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420 
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(2) INFORMATION FOR SEQ 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 810 base pairs 



(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

CTTGGGAGCT ATTTTGCTTA GTTTTCGCCT TATTCCTCAA CTTGGCTGGG GACGTGGTCT 
TTTTAGTTCC ATTTTTCTAG CGATCTCAGC CTTCTGTAAT GCCGGTTTTG ATAATTTAGG 
GAGCACTAGT TTATTTGCTT TTCAGACAGA TTTACTGGTC AATCTGGTGA TTGCAGGCTT 
GATTATTACA GGCGGCCTTG GTTTTATGGT CTGGTTTGAT TTGGCTGGTC ATGTAGGAAG 
AAAGAAAAAA GGACGTCTGC ACTTTCATAC GAAGCTTGTA CTATTATTGA CTATAGGTTT 
GTTGTTATTT GGAACGGCAA CTACTCTCTT TCTTGAGTGG AACAATGCTG GAACGATTGG 
CAATCTCCCT GTTGCCGATA AGGTTTTAGT TAGCTTTTTT CAAACAGTGA CGATGCGAAC 
AGCTGGCTTT TCTACGATAG ATT AT AC TC A GGCTCATCCT GTGACTCTTT TGATTTATAT 
CTTACAGATG TTTCTAGGTG GGGCACCTGG AGGAACAGCT GGGGGACTCA AGATTACGAC 
ATTTTTTGTC CTCTTGGTCT TTGCACAAAG TGAGCTTTTA GGCTTGCCTC ATGCCAATGT 
TGCGAGACGA ACGATTGCGC CGCGAACGGT TCAAAAATCC TTTAGTGTCT TTATTATCTT 
TTTGATGAGC TTCCTTGATA GGATTGATTC TGCTAGGGAT AACAGCCAAA GGCAATCCTC 
CCTTTATCCA CCTCCATATT TGAAACCATT TCCAGCTCCT TAGTACAGTT GGTGTTAACG 
GCAAATCTGA CTCCCTGACC TTGGGAAATT 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2179 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : single 
(D)^TQPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

CTCAACCCTG GCATGGAAAA TATCCATGTA GCGGCTGGAG ATTTGCTTAA GGGTGTGGAA 
ATTGAGGCAG ATGTGATTGT GGCTAATATC TTGGCGGATA TCCTCATTCA TCTAATAGAC 
GATGCCTATC GTTTGGTTAA GGACGAAGGC TACCTGATCA TGAGTGGCAT TATCAAGGAT 
AAGTGGGACA TGGTGCGCGA GTCGGCTGAA TCAGCTGGAT TTTTCCTCGA AACTCACATG 
GTTCAAGGGG AATGGAATAC CTGTGTCTTT AAGAAAACCA AGGATATCTC TGGTGTGATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
810 
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GGAGGCTAGC 
GGACAAGGAA 
TACCTTAGTC 
TCAGTTTGAG 
CGCATCTGGA 
GGGTGCTAGC 
GAAATTGGGT 
CAAGCGTAAT 
GCTGGACCAG 
CGCTGCGCTT 
TGGTCCAGAA 
TTTGGCAGGC 
AGCCCTTAGT 
GTTCAGAATT 
ATTTCTTGCG 
AGGGCAATCT 
ACATTCAGAT 
ATGACGAATT 
CCTTGACCTT 
GTGAATTGCC 
TAGGGGCTGA 
ACATTTATAC 
AGACCTATAA 
GGTTCGGGAG 
CTGCAGCAGC 
ACGACTCTGT 
GCCAGATTGA 
TTGCCAAGGT 
GCGCCCTGAA 
AACCTTTGGG 
ACTTTGCTGC 
ACCCTTATGT 



ATGCAGCAGT 
ACCAGTAAGC 
TTTGATGATG 
TTGGTCCAAG 
TTTCCCAAGG 
CAAATCTGGG 
AAAAAAGTTG 
ATTGTACCAA 
TTTGATTCTA 
CTGCAAGCAG 
GGTGGTCTGT 
CTAGGTTCTC 
GTTTTAGTAG 
TCCAGAGAAG 
TGCAATCGAG 
TCACACTTTT 
GAGCCATATT 
TGCCAATATT 
CTTTGACGAT 
TCACTTGACG 
TGTGGAGAAG 
TAAGATGCGA 
AAACAGCTTT 
AAATCCTTCC 
CTATCTGGCT 
CTTTGATTAT 
CCTCATCATG 
AAATGGTCTT 
TCCTGAAGTG 
GCAAGAATAT 
TAACAGTGGC 
ACTCATGAG 



ATTTTGTAAA 
ATATGTTTCA 
GCATCAAGCG 
AATTAAATGA 
GAGACAAGCT 
CCTTTCCTGC 
AAAAACTAGA 
GTATTAAACT 
TCATAGTAGC 
TCACTGGTCT 
CACCTGCAGA 
GTATTTTGCG 
AATTAGAGAA 
GAACTCTGGG 
AAAGCTCGTG 
GAGGATTTCG 
GGCAACTATG 
GCCCAAGCTG 
GCCTTGGTGG 
GCAGCTATTC 
GCCTTGACCA 
GCTGGGGATT 
GTCACCTATG 
GTTCCTTCTC 
CAGGTCAAGT 
CTTCTAGCTG 
AAGGACTTTG 
GAAAAGATGA 
ACTATTGACG 
TGTCAGGAAG 
AAGGATTCCG 



AGGCAGTGCT 
GGTTATGCGC 
CTTGGCGCGC 
CAATGTGGAA 
GGAGTTCATC 
AGACTGGTCA 
AAAAATTGCC 
TTTCGAGAAA 
CTATGAAGAA 
TGAAACAGGA 
AATCGAAAGT 
AGCAGAAACA 
ATAAGAGGAA 
ACTTAACAGC 
AAGACATCAA 
AGAAGGCCTT 
CTTTTATGCC 
GGATGGAATT 
TAGCAGACGA 
GTCAGGCCAA 
ATCTCGGTGA 
TTGAAATGGC 
AGAATTCTAC 
AGAGGGACTT 
CTGAAAAACT 
AACAAGAAGT 
CACCAGTCGC 
CCTTTGCAGA 
ATGCCTATGA 
TTGCTCGTTA 
GTGGTTATGC 



ATCTCTCCTG 
TTGAAAGAAG 
GTGCTGGATA 
CTACCAGTCC 
GCTCAAAAAG 
GTTGCCAAGT 
CTTGGAGCAG 
AAAGCAGATT 
TCAGCTAAAG 
GCCAAACTGC 
TTTGAAGCTA 
GCACCGCTTT 
GAAAATGGAA 
CCTATACCAA 
CCAGTTTAGC 
TGCGGAATTG 
TCAGACGACG 
TGAAACAGAT 
GGAAGTCTTG 
AATCAAAAAA 
AGTTTTCTAT 
TGACTTTGAA 
CAAAATCACG 
CGGTAAGCAC 
CTTGGCTGAT 
GGACCGTGTC 
TCAGAGATAC 
CTGGAAATTG 
TTTGGTCATG 
CCAAGAAGAG 
GGCGGACCCA 



TCACTATCGA 
AGGATGAGGT 
TGGAAAATCG 
AAGTGACCAT 
TAACCGAACT 
GGGATGGCAA 
CCGAGCAAAG 
TTCTAGCTCA 
AAGGAGAAGC 
TCTTTATCTT 
AAGGAGCAGT 
ACGCCTTATC 
CAAAAACACC 
GACCGTGAGG 
CGTGATTACA 
GAACAGATCT 
GACTATAGCA 
GCCAGCGTAG 
GACCGTTTGG 
GCCCACTATC 
AGTCCGCAGG 
GCCCATGGCA 
AGGATGCTGA 
CAAAATACGG 
ATGAAGGGAT 
ATGTTTGACC 
CTCAAGCATG 
GACTTGGACA 
AAGTCGGTAG 
CGCTGGGTGG 
TATCGCGTAC 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2179 



(2) INFORMATION FOR SEQ ID NO: 38: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2575 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
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(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

CTTAAGGTTT TCTTCNATAC TTTGGTTTTC AATTAACCCA NAGTTCTGGA AGAGGTAGCC 60 

CAATTCNTGA CGGAGAAAAT CNCTCGAGTT ATNATTGGCC AAGTCTTTAC CTCGGTAANA 120 

AATCGTCCCA GCATAAGGTT CTAATTTCCC AATCATGTTC ATCAAGGTGT TTTCCGCTAT 180 

CACTTGAACC AATTAAGGCA TAGATTTTCC AGCCTCAAAT GTCATTGAAA GATTCGAAAA 240 

TAACTCTCGT TCTCCAAAAG ATTTACTCAC CTGTTTAAGT TCAATCATAT TAACCTCCCT 300 

TCAAAACAAG AATGGACATC TTGTTTTCTT TCTGCATTTG GACATGTAAC TGTAAAAGAG 360 

ATAGACCAGT AAAGAGTAAC AAGACTAAGA AAGCAACTCC TATCTCTACC TGAAGAAATA 420 

CACTCGCAAC AAATCCCAGT AAAAACACAC CCAGTTGAGC AAAGAGATAA GTGCGATGGA 480 

TTTCTAAGAA CCTGAGACCT GCAATGCGTT TGATAAAAAT GGCACGTCTA AATTCTTCAA 540 

AGTAGAGCCT ATTCATAGTG TTAAACAACA AGATTGAAGT TGCAATCCCA AGCACAGCTC 600 

CTGCTAGCAT TACCCAACGT TCCCTCTGGA TATTATCCAA TAATGTGATG TAGTTGTGGT 660 

AACCTGTTTG CATTTCTGAG ACCCAATTTT CAATGCCTTG TCTCTGGATA AGCTCCTGGG 720 

CATCAGACAA TTGATTAAAG AGAACGTAGT TCTGTACTGC GTCTATCCAA AACAAAATGG 780 

ACTGTGGACC AGTTGATTGG GGTGTTATAA CAATGATGAT TGGATCTTTC AAAAACTGCT 840 

GGTAAGAAAT AGGGGTCGTA TTATACACAA AACGATCCTG ACCTGATTCT AAATAACCTA 900 

CCGTAGCAGT CATTTGCTGT CGTTCATCTT GACTAGACAT GCGACTGGTT AAGTCGTCTT 960 

CAAAAACAGA TTTATAATGT TCTTCCTCTG AACGGAGGTG TTCAGGCAGC AATAAGACAA 1020 

ACTCCCCTAC ATCAAGGTGA TTCATCTTTT GCTCAATAGT TGTATCTACA GGAATGTTTT 1080 

GACGCTCCAA GTATTGCGGT GTGACGATAA GGACATTGCC ATTCGGGTTG TAATCGTGCC 1140 

ATTCTGTAGA GGTTATCAAG TTTTTGGAGG AAGCCATGCC ATTTTGCAAA GTACGGTCAA 1200 

TTAACTGGTG TCTAGATAAG AAAGCCTTTT GTTCTGAAAC AGCCAGATCC ATCAATTGAT 1260 

ACCAAGTTCT GAGTTTCCTT TGAGCTTGTT CATCAAAACC AGGACTCGTT CCTTCACGGC 1320 

TGATTGATAG GGTAATCAGT TGCCTTTCCT GACTCCAAAC CTCTTGTCCA ATCCGATGCT 1380 

GTTGCCAGGC TTGGGAATAC TTTAAGCTAC TCCCTATTCC CAGCGTTACA ATAATAATCG 1440 

CTAATAGTTG ACCGATTAAA ATCAAACTAA TGATTCCTCT GACAGGCACT TGCCCTTTTA 1500 

TAATCTGCAT CAGGTGTATC TTTTTTATCC CAACTGCGAA GAGTTGAGCG AAAAACAGGG 1560 

AGATCGACAA CAAGATGAGA TTATAACTGA GCAAGCCTGC TCCTATGGTC ATTAGGGATT 1620 

GCGTTGGAAG CGACAGAATT TTTTGCATAA GAATGGCGAG CACGCCAGCT AAACTGAAAC 1680 

CAACAGCTAT CCCTTTCAAA TCCTCACCAA CTGGTCTAAG GAAAATGGAC CACCGTTTCT 1740 

CTCCTGAAAT GAGGCGAATG CCCGACGACC GTAATTGGCT AATTTGACTA ATTAAAGTCA 1800 

GTGCTACAAA GGTJAAGATG AAAATCAATA GACTGATTAA CTGAAATCCG TTACTAAAGA 1860 

TAGCCATTAA CGTAGATAGT TTAGATGGAA TTGTCAGGTT CATATTAGTC AAGCCAAGTT 1920 

GACTTAGCTC CTCCCTTAGC AAGTGAATAT CTAGGTGTCC ATCGAATACA AAATAGTTTG 1980 

TTTCAACACT ACTACTTTGA GCATCTTCTA GTTTTTTTTC CTGAAGCCCA TCAGGCAACT 2040 

TTCCGTCCCC AAAGGTTGTA TAAGAAAAAA CTGTAGTACC TTCTGAGTTA GGATCTTGGT 2100 

GTTGAATCGC AATAAAGCTA TCTGTTTCTT CTGCTAGTTT TTCCAAGCGT GTAGCAACAT 2160 

GCTGAAAAGT TGTTTCAGGG GAAGAATCAT GTACAACAAC GTTGGGGTAA TAATAGGAGA 2220 

CATATGTATC AGTCCAAATG GTAAATACCC AAACAAAGAA CAAACTAGCA AGCAGGTTGG 2280 

ATATGAGTAT AAATAATTTT TTCATAGTGC ATTCCTTATT CTAAAACAGA GGGCAGAAAT 2340 

ACGCCTACCC TCTATAAACT AACAAGTTTA CTAAACTATT GATAAAAGAA TCTTCTTCAA 2400 

TTATTCTTTA CCATCCCGCA TCAAATGTAA ATTTTAACAT ATTATTTACT CCTCTGAAAT 2460 
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GAAATTTATG AAAATGAAGC GAAAATGGCA ACTACATATT TTAATCAGAA AATCTCAGTC 2520 

TGAATTATAG TATCCTATTT TTCCCCTCCT TTCCTAACCT GGCACGTTCC TTTCT 2575 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 39: 

CTATTCGGAA TAAGAAAAAT GCCAGTATGG TATTGGCAGC CAAGGCTGTC AAAGATGGTG 60 

AAGCAGACGC TGTCCTTTCG GCTGGGAATA CAGGTGCTTT GTTGGCTGCT GGATTCTTCA 120 

TCGTGGGTCG TATCAAGAAT ATCGACCGTC CTGGACTTAT GTCAACATTG CCGACTGTAG 180 

ATGGGAAAGG TTTTGACATG CTTGACCTCG GTGCTAATGC AGAAAATACA GCCCAGCACC 240 

TCCATCAATA CGCTGTCCTA GGTTCTTTCT ATGCCAAGAA TGTTCGTGGC ATTGCGCAAC 300 

CACGTGTTGG TTTGCTCAAC AACGGAACAG AGAGTAGCAA GGGCGACCCG CTTCGTAAGG 360 

AAACTTATGA ATTACTGGCG GCTGATGAAA GTTTGAACTT TATCGGAAAC GTGGAAGCGC 420 

GTGATTTGAT GAATGGCGTT GCAGATGTTG TTGTGGCAGA TGGTTTCACG GGAAACGCTG 480 

TGCTCAAATC CATCGAAGGG ACAGCTATGG GAATCATGGG CTTGCTCAAG ACAGCTATTA 540 

CAGGTGGTGG TCTTCGAGCG AAACTAGGTG CCCTCCTTCT CAAGGACAGC CTCAGAGGTT 600 

TGAAAAAACA GCTCAATCAT TCAGATGTTG GTGGAGCGGT CTTGTCTGGT GTCAAGGCAC 660 

CTGTTGTCAA GACTCATGGC TCAAGCGATG CCAAGGCTGT TTATAGTACG ATTCGCCAGA 720 

TTCGTACCAT GCTAGAAACA GAAGTAGTTG CCCAGACTGC GCGTGAATTT TCAGGAGAAT 780 
AAAAGAGATG AAAGAAAAAG AAATTTTTGA CAGTATTGTG ACCATTATCC AAGAGCGACA 
GGGAGAGGAC TTTGTCGTGA CAGAATCCTT GAGTCTGAAA GACGACTTGG ATGCTGACTC 
AGTTGATTTG ATGGAGTTTA TCTTGACGCT GGAGGATGAA TTTAGTATCG AAATCAGCGA 

TGAGGAAATT GACCAACTCC AAAGTGTAGG AGATGTGGTT AAAATCATTC AAGGAAAATA iv-w 

GCAATCGGAG TTCCAAGTCA ACGGAAGTAG ATGGTTTTTA GAAATGAGAA ATATCGGACA 1080 

AGCTGGTAAA ATCTTGGCTG ACAGTGGTTA TCAAGGGCTC ATGAAGATAT ATCCTCAAGC 1140 

ACAAACTTCC ACGTAAATCC AGCAAACTCA AGCCGCTAAC AGTTGAAGAT AAAGCCTGTA 1200 

ATCATGCGCT ATCTAAGGAG AGAAGCAAGG TTGAGAATAT CTTTGCCAAA GTAAAAACGT 1260 

TTAAAATGTT TTCAACAACC TATCGAAATC ATCGTAAACG CTTCGGATTA CGAATGAATT 1320 

TGATTGCTGG TATTATCAAT CATGAACTAG GATTCTAGTT TTGCAGGAAG TCTAATAGTA 1380 

AAAAAGTGAT TAGAAAACAT CTTTTTTAAA AATAGAGATG ATTTTGAAAC AAAAAAGCTA 1440 

ATTCAAGACG TTTCGATGCC AATTCAAGAT TTGGATGAAA AAAATTAATA GATACTGTTA 1500 

TACTAAACTT GTCAAGTTTG TAACAAGACA AATATTAAAA ATAAAAAAGA GGTATTCGTT 1560 

ATGAATACAA AAAAGATGTC ACAATTTGAA ATTATGGATA CTGAGATGCT TGCTTGAGTT 1620 

GAAGGTGGCG GATGCAATTG GGGAGATTTT GCCAAAGCAG GTGTTGGAGG AGGAGTAGCA 1680 
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CGAGGTCTTC AG 1692 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CCACGTTGGG GTCTTAGTAC CTGCTGATGA CGGTTTCTTA TTTGTAGAGA AATTGACTTT 60 

CGAAGAGCCC TACCAAGCGA TTAAATTTGC TAGTAAGGAA GATTGCTACA AGTATTTGGG 120 

CACCAAGTAT GCGGATTATA CAGGCGAGGG ACTGGCTAAG CCTTTTATCA TGGATAATGA 180 

TAAGTGGGTT AAACTTTAAT AGTGTTATGA TATGACAGTA ATCTTAGGAT TTTTATTTCT 240 

ATTTTTGATA AGTTGGTGGG TTATTTCAAT CGTTAATAGC CAGAAACCAC GCAAAAATGA 300 

AACATTTATT GGCTATGTCC AAAGATACGA TATTGATGGG AATGCCGTGA TCAATCAATA 360 

ACCGTGTCCT TCAGAACTGA GAGGAGTTCT TGAAATCGTA AAACCACTTT GAACAAGAGT 420 

AACTTCAACC CATCGGCTCC GACGGATTAA GTGCTTTCGT GAATACCAAA ATCAGCCGCA 480 

ATTTCTTCAT AAGTGCGATA TTCTCGCACA TATTGAAGAG TGGCCATAAG AAGGTCTTCT 540 

AGGCTTAATT TAGGTTTTCG TCCACCTTTT GCGTGTTTAA GTTGATAAGC TGTTTTTAAT 600 

ACAGCTAACA TCTCTTCAAA AGTCGTGCGT TGAACACCAA CAAGACGCTT AAATCGTGCA 660 

TCGGTTAGTT GTTTACTTGC TTCATCATTC ATAGAACTAC TATACCATAT TTTATTTCGC 720 

AGGAAGTCTG ATCTCATCCG GTTTTGCTTG ACGGGCTTTT GCGGTTATTT CTTCAAAAGG 780 

GACCCACCAG ACTGGCCCAC AGCCCTTGAG ACCGTGGCCA TTCATATCAC CTCTTCGTCT 840 

AGGGAAAAGG TGCCAGTGGA GATGGGCATC GCCATTTCCT AGAAGTTCAA TATTCATTTT 900 

CTCAGCAGTA AAGGCCTTGG CAACTGCCTC TTGGACTAAA CTCATTTCTT CTAGAAAATG 960 

GAGTCTTGTG TCCTTTTCTA AAATGGTGCA ATTCCGCGAC ATGTTCCTTG GCTAAAAAAA 1020 

AACTATAGCT TGCAAGCTCA ATACTGATGG TCTCCAATCA CAAGATAGCC TGTTTCCAAC 1080 

TCTTTGACAA AGTAGGGATT TTCTCCCTTC TTGATGA 1 117 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1385 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(DJ TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CTACAGCGTA GAGTTCAGGG TATTCAATTG AAATGCGTTT AATCGTACGT GTTGTAGCAT 60 

GTTTTCTGAC AAAAAACGGG ATTCGCTTAA TCAAGTCTTG TGGGACTAGC GCCAGAATCT 120 

TCTCAGTAGT TTCTTTGTCA CTAATATNAG ACATTGCAAG CCTTTTCTTA ATCATTTCCT 180 

GTTCTTTTTC TGTAAAATCT TTTAATTCCA TTCGATTAGT CCTCCTATTT TCTCTAAGTT 240 

AAATTATGTA CTAATACNGA TGAAACTACA AAGAATAAAC TTTAAGAAAT CTTCTCACTG 300 

ATCAAGATTT CTTACTTTCC AATAGACTTC CTGCGAAACA AAATATGGTA TAGTAGTTCT 360 

ATGAATGATG AAGCAAGTAA ACAACTAACT GATGCACGAT TTAAACGTCT TGTTGGTGTT 420 

CAGCGCACGA CTTTTGAAGA GATATTAGCT GTATTAAAAA CAGCTTATCA ACTTAAACAC 480 

GCAAAAGGTG GACGAAAACC TAAATTAAGC CTAGAAGACC TTCTTATGGC CACTCTTCAA 540 

TATGTGCGAG AATATCGAAC TTATGAACAA ATTGCGGCTG ATTTTGGTAT CCACGAAAGC 600 

AACTTAATCC GTCGGAGCCA ATGGGTTGAA GTAACTCTTG TTCAAAGTGG TGTTACGATT 660 

TCAAGAACTC CTCTCAGTTC TGAGGACACG GTAATGATTG ATGCGACGGA AGTACAAATC 720 

AATCGCCCTA AAAAAAGAAT TAGCGAATCA TTCTGGTAAA AAGAAATTTC ACGCTATOAA 780 

GGCTCAAGCG ATTGTCACAA GTCAAGGGAG AATTGTTTCT TTGGATATCG CTGTGAACTA 840 

TAGCATGATA TGAAGTTGTT CAAAATGAGT TGCAGAAATA TCGGACAAGC TGGTAAAATC 900 

TTGGCTGACA GTGATTATCA AGGGCTCATG AAGATATATC CTCAAGCACA AACTCCACGT 960 

AAATCCAGCA AACTCAAGCC GCTAACAGCT GAAGATAAAG CCTATAACCA TGCACTATCT 1020 

AAGGAGAGAA GCAAGGTTGA GAACATCTTT GCCAAAGTAA AAACGTTTAA AATGTTTTCA 1080 

ACAACCTATC GAAATCATCG TAAACGCTTC GGATTACGAA TGAATTTGAT TGCTGGCATT 1140 

ATCAATCATG AACTAGGATT CTAGTTTTGC AGGAAGTCTA TTAAAAATTG AGAAATTAAA 1200 

AGAGATGATA TTAGGGAGAA CTGATTTAGG CAAAGTGTCT CCCCTAGTTA CGACATAAAG 1260 

AAGCATAAAT CGATAATATT TATGTTTCTT TACTAGTGAA ACGCCTAGCC AAATTACCAT 1320 

AGTTACGACG TCATGGATCC TCTAAATCGA TTATATTTAG GGGTCATGAC TAGTGAAGCA 1380 

GTTAG 1385 

(2) INFORMATION FOR SEQ ID NO: 42: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 690 base pairs 

(B) TYPE: nucleic acid- 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 



CTGAGAAAAG TGGTGGTTGT TCGATTAGAA AAACCAGTAA CTTTTCACAA TATGATAGCT 60 

CCTGATAAGG AAGTAGAAGT ATCCTTGCTC TTCTTTATCA TCAATAACTC AAGTTCAAGT 120 

CAAACAAATA TTCTGGCTCA GTTGATGGAC TTTTTCACAG GAAATGGACA TCTTGAAGAC 180 
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CTATCAAAAA TTTCCGAACC AGAAAAACTT TATGCTTACA TTGCTGAAGC AACCGCTTAA 240 

TCTTGTCTAT TAAAAAATAA AATCGGAGGA AATCTAAATG ATTAAAATTC TTGCTGCCTG 300 

CGGTGCAGGT GTTAACTCAA GTCACCAAAT TAAAAGTGCT CTAGAAGAAG AACTTTCAAA 360 

CCG TGGTTAT GATGTTCACT GTGATGCAGT CATGGTGAAA GATGTAAACG AAGACCTTAT 420 

GAAAGGTTAC GATATCTTTA CACCAATCGC TGCAACAGAT CTTGGTTTTG AACCAGGTAT 480 

CCCAGTTATC GAAGCTGGGC CAATCTTATT CCGTATACCA GCAATGAGCG CTCCAGTATT 540 

TGACAATATT AGACTCCCTG CGAAACAAAA TATGGTATAG TAGTTCTATG AATGATGAAG 600 

CAAGTAAACA ACTAACTGAT GCACGATTTA AGCGTCTTGT TGGTGTTCAG CGTACCACTT 660 

TTGAAGAGAT GTTAGCTGTA TTAAAAACAG 690 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 874 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

CTTCATCCTG CCATACGGTC GGTAATTGAC CTGCTGACAT CCGAAACCTC CCTTAAATCG 60 

CATTCTTGTC AAAACCGAGT TTGCGTTGAA TAAACTTAGC GATTTCGACG ATGATAATCA 120 

TTGAGAAGCT TCCAGCCATA ACAATTCCCC ATTGTGACAA GTCTAGTTTG GTTACGTGGA 1B0 

AAATTCCTTC AAGCGGTTCT ACAACGATTG TCGCCATGAG AAGGATAAAG GATACCACCA 240 

TGGACCAGTT AAAGGTCTTA TACTTGAATG GGCTAGTGAT GGACTTAGAA TAGGTGATTT 300 

GGAGCGTCCT ATTGGCTAGG AAATGCTGAT CATAGTCCTT TGCTGAGGCT AGGGTGTTTC 360 

AACATTCAAC ACTCAACTGG TTGATCTAGT TGATAGGAAG GGAATTACTA TAAAATACTC 420 

AGGCTTCCAT CATATTTTTT GAAACGATTG TGTAATCAAA ATGTACCAAT ATTGTAGTAT 480 

TAGTACAGAA GATGTTGTGA ATGGATAAAT ATATCATAAC TGCTGTCTCA AAAAGATTTC 540 

ATATGTCTGT GCATATATAA TAGACTTCCT GCAAAACTAG AATCCTAGTT CATGATTGAT 600 

AATACCAGCA ATC^AATTCA TTCGTAATCC AAAGCGTTTA CGATGATTTC GATAGGTTGT 660 

TGAAAACATT TTAAACGTTT TTACTTTGGC AAAGATGTTC TCAACCTTGC TTCTTTCCTT 720 

GGATAGCGCA TGGTTACAGG CTTTATCTTC AGCTGTTAGC GGCTTGAGTT TGCTGGATTT 780 

ACGTGGAGTT TGTGCTTGAG GATATATCTT CATGGGCCCT TGATAATCAC TGTCAGCCAA 840 
GATTTTACCA GATCTNNTTT TTTAAAAGGG GGCN 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1461 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



CTTCATCCTG CCATACGGTC GGTAATTGAT ATGCTGACAT CCGAAACCTC CCTTAAATCG 
CATTCTTGTC AAAACCGAGT TTGCGTTGAA TAAACTTAAC GATTTCGACG ATGATAATCA 
TTGAGAAGCT TCCAGCCATA ACAATTCCCC ATTGTGACAA GTCTAGTTTG GTTACGTGGA 
AGATTCCTTC AAGCGGTTCT ACAACGATTG TTGCCATGAG AAGGATAAAG GATACCAAGA 
TGGACCAGTT AAAGGTCTTA GACTTGAATG GGCCAACTGT CAAGATGGAT TGGTAGACAG 
ACTTGACATT GTAGGCATGG AAGAGCTGAA TCAAACCAAG GGTTGCAAAG GCCATCGTTA 
GGGCATCTGC ATGAATAGCA TGATTGTCAC CCACATGAAC TGGGTAAGCA ATCGCAAGGC 
CATAAACACT CATAACAAGA GCTGCTTGGA GTACACCTTG ATAAATGATA GAACTCAAAA 
CACCACCTGA GAAGAAGCTT GCCTTGCGTC CACGTGGTTT ATGATTCATG ACACCAGGTT 
CTGCAGGTTC AACACCACAG AACGATAGCT GGGAAGGTAT CCGTTACCAA GTTGATCCAC 
AAAAGATGAA CCGGCTGTAA GACATCCCAA CCAAACAAGG TTGATAGGAA GATGGTTAAT 
ACTTCAGCAG TATTAGCAGA AAGTAGGTAC TGAATAGTCT TTTGAATGTT TGAGAAGACC 
TTACGTCCTT CTTCCACTGC GACGATAATA GTCGCAAAGT TATCATCTGC AAGAATCATA 
TCAGAAGCCC CCTTAGAAAC CTCTGTACCA GTGATTCCCA TACCGATACC GATATCGGCT 
GTTTTCAGAG CTGGCGCGTC ATTGACACCG TCACCTGTCA TGGCAACGAC TTTACCTTGT 
TTTTGCCAAG CCTTGACGAT ACGAACCTTG TGTTCTGGAG ACACACGGGC ATAAACAGAG 
TATTGACCAA CGACTTTTTC AAATTCTTCA TCTGACAGTT CATTGAGTTC AGCACCAGTT 
AAAACGTGAC CTTCTGTATC GTTTGCGTCA ATGATTCCCA AACGTTTGGC AATGGCTTCC 
GCTGTGTCTT GGTGGTCACC TGTAATCATA ATTGGACGGA TTCCCGCTTC CTTAGCCACA 
CGAACAGCCT CAGCGGCTTC AGGACGTTCA GGGTCAATCA TCCCAATCAA ACCAGTAAAA 
ATTAAATCAT TTTCAAGCTC TTCAGAAGTG AGATTTTCTG GAATACTATC GATAATCTTA 
TAAGCACCTG CAAGGACACG CAAGGCTTGA TGAGCCATTT CAGAATTGTT TGTATGAATG 
AGATTTGTAA CCTTCTCATC AATCGGAGCA ATATCCCCAG CCTTATCACG AAAGAAGACA 
ACGTTTTAAG AGTTGGTCTG GCGCACCCTT GACTGCTACA AGGAAACGAC CATCTGGCAA 
TGGGTGAACT GTTGACATGA G 

(2) INFORMATION FOR SEQ ID NO: 45: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2445 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1461 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

CTCCCATCAA GCTTGCTTCA AGTTTTTCTG TCAAGGCTTG CGCTTCTTCA TCCTTGACAT 60 

TAACAGTAAT CAAATGCACA TCAAATGGTG CCAATTCTTT AGGGAAATTG ATTCCCCAAG 120 

CGTAACGGTA TTCACCTTTT GGCGTTTTGT TAAACAAAGA GGCGAGCGTG TTGCTCCATC 180 

ACTGCTGAAA GAAGACGGCT GACACCGATA CCGTAACATC CCATGATGAT TGGCACAGCA 240 

CGACCATTTT CATCCAAGAC ATCTGCTCCC ATGCTTGCTG AATAGCGAGT TCCGAGTTTG 300 

AAAATATGAC CGATCTCAAT ACCACGCGCA AAGTTAAGGA CACCTTGTCC ATCTGGGGAA 360 

ATTTCACCCT CACGAACTTC ACGGATATCC ACATATTCTG CAGTAAAATC ACGGCCTGGG 420 

TTCACACCAG TCAAGTGGTA GCCATCTTCG TTAGCACCGA CAACTGCATT GCGAACATCT 480 

TGCACCTTAC GATCTGCAAT AATTTTAATA TTCTCTGGCA AACCAACTGG TCCAAGTGAA 540 

CCAAATCCTG CTTGAACAAC ATTCGCCACT TCTTCTTCGC TAGCAACGTC AAAGAAATTT 600 

GCTCCCAAGT GATTTTTCAA CTTGACTTCG TTGAGTTGGT CATTTCCAAC TAGAAGGGCT 660 

GCAACAAGCT CACCATCTGC AATGTAGAAG AGGGTTTTAA TCGTTTGTTC TTCTGGAACA 720 

TTGAGGAAGG CTGCAACTTC ATCAATTGAT TTAACATCTG GCGTTTCAAC ACGAGTAACT 780 

TCTTCTTCAG CGACAACACG GTTGCTTGGT TTGTACTCGT TTGTTGCCAT TTCTAAGTTA 840 

GCTGCATAGC TAGACTCACT TGAGTAAGCA ATGGTATCTT CACCAGAGAC TATCCATTTG 900 

AGCAATTCTG CCTTGATTTC TTCTTGCACT TCTGCAGGAA TTTCGTCAAA TGAGGCAACT 960 

GACTTGTCCA AGACAACCCA GCGGTCAAGG TCTGTACGAG CAGATGTAAT GGCCATAAAT 1020 

TCTTGGCTAT CCTTACCACC CATGGCTCCA CCGTCACCAA TAATAGCCTT GAAGTCTAAA 1080 

CCACTACGAG TGAAAATACG CTCATAGGCT GCTTTGTACT CATCATAAAC ACTATCCAAA 1140 

CTATCATAGT TAGCGTGGAA ACTATAAGCA TCCTTCATGA TAAACTCACG TGTACGAAGA 1200 

AGTCCATTAC GTGGGCGTTT TTCATCACGA TACTTGGGCT GAATTTGATA AAGGTTGAGT 1260 

GGCAATTGCT TGTAAGATTT AACAGAATCA CGGACAATAG CTGTAAAGGT TTCTTCGTGA 1320 

GTTGGACCTA AGATAAAGTC TGATTTTTCA CGGTTTTTCA GTTTGTAAAG GTCTTCACCA 1380 

TAGGTTTCGT AACGACCTGA TTCACGCCAC AATTCTGCAC TAAGAAGGGC TGGAGCCAAC 1440 

ATCTCAACAG CACCAATCTT TTCGAATTCT TGGCGCATGA TGTTTTTAGC TTTTTCAATC 1500 

ACACGGTTGG CAAGTGGTAG ATAAGAATAA ACACCTGCTG AAACTTGGCG AACATAACCA 1560 

GCACGCAACA TAAGAGCATG GCTGATAACT TGAGCATCGC TTGGCATTTC GCGAAGCGTT 1620 

GGGATAGGCA TTTTACTTTG TTTCATAATA TTCCTCGATT ATCTAAAAAA GAGTCGCATA 1680 

ATGTCATTCC AAGTCACAGC AATCATCAAG ACAACCATGA TGACCACTCC GGCCAAGGTG 1740 

ACATAGGTTT CAATTTCTTG TTTCAATGGT TTGCGGCGGA TGGCTTCTAG GATATTGAGC 1800 

ACAATCTTAC CACpATCCAA GGCTGGAATC GGAATAAGAT TAAAAATCCC AATATTGATG I860 

GAAATCATTG CCAAGAAGTA CAAGAATATT CTCAATTCCA TTTTTAGCAG CATCACTACT 1920 

TGCCTTAAAG ATAGCAACAG GTCCACCCAA CTTGTTCAAA TCCGGTTGGA AAATCAGATT 1980 

TTTCAGAGAC TGAAAAAATT CCGAAGAGCT GAGTTCAGCA GCAGTTGTAA AACCACCTAC 2040 

AAACATGGAT AGAAAATCTG ACTTAACCCC CGGTTGAACA CCTAGAAGGT AACGACCTTG 2100 

ACTATCTTCG GGTGTAACAG TGACTTGTTT GTCACTCCCC TTTTCAGAAA TAGTCACATC 2160 

CAAAGTCGGT GCCGTCTTAT CTTTGGTTTC TGTTTCCACA GCTTGGATCA AGCTTTCCCA 2220 

GTCGCCAACC TCATGTGAGC CAACCTTGGC AATTTGTGCC GTTTCTGGTA CTCCTACCTT 2280 

GGGCCAAGGG CACCCTTGGG GGGCATGGAT TATGGAACTT GGATTGGTAT CTAACCATCT 2340 

CTGACACCAC CCTGCATAAA GATTAAAACC CAAAAAACAA CAACACCTAA GATAAAATTG 2400 

TTCATAGGAC CTGCAAAATT GGTAATCAGT TTGCCCCAGA TAAAG 2445 
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(2) INFORMATION FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 



NCAGGCTANC TCTCGTCTGA ATGAAGATGG AACCTTTGCT GACAAGATTG TCATGGGACG 60 

TCACCAAGGG GTCAACCAAG AGTATCCAGC TAATATTGTT GACTACATGG ACGTTTCACC 120 

AAAACAGGTA GTTGCCGTTG CGACAGCATG TATTCCTTTC TTGGAAAACG ATGACTCCAA 180 

CCGTGCCCTC ATGGGAGCCA ATATGCAACG TCAGGCTGTG CCATTGATTA ATCCTCAGGC 240 

ACCTTACGTT GGTACTGGTA TGGAATACCA AGCAGCCCAC GATTCTGGTG CGGCTGTGAT 300 

TGCTCAGTAT GATGGTAAAG TTACTTACGC AGATGCTGAC AAGGTAGAAG TTCGTCGTGA 360 

AGATGGTTCA TTGGATGTTT ACCACATCCA AAAATTCCGT CGTTCAAACT CAGGTACTGC 420 

TTACAACCAA CGCACTCTCG TAAAAGTTGG TGATGTCGTT GAAAAAGGCG ATTTCATCGC 480 

TGACGGACCT TCTATGGAAA ATGGAGAAAT GGCGCTTGGA CAAAACCCAA TCGTTGCCTA 540 

CATGACTTGG GAAGGTTACA ACTTCGAGGA TGCCGTTATC ATGAGCGAAC GCTTGGTGAA 600 

GGACGATGTC TACACATCTG TTCACCTTGA AGAATACGAA TCAGAAACGC GCGATACAAA 660 

GCTTGGGCCT GAAGAAATCA CTCGATCGAA ATTCCAAACG TTGGTGAAGA TGCCCTCAAA 720 

GACCTTGACG AAATGGGGAT TATCCGTATT GGTGCTGAGG TTAAAGAAGG TGATATTCTT 780 

GTAGGTAAAG TAACACCTAA GGGTGAGAAA GATCTTTCAG CTGGAAGAAC GTCTCTTGCA 840 

CGCTATCTTT GGAGACAAGT CTCGTGAAGT GCGTGATACT TCTCTTCGTG TACCACACGG 900 

TGCCGATGGT GTCGTTCGTG ATGTTAAGAT CTTTACACGT GTAAATGGAG ATGAGTTGCA 960 

ATCAGGTGTT AACATGTTGG TTCGTGTTTA CATCGCTCAA AAACGTAAGA TTAAGGTCGG 1020 

AGATAAAATG GCCGGACGTC ACGGAAACAA AGGGGTTGTC TCTCGTATCG TTCCTGTAGA 1080 

AGACATGCCT TACCTTCCAG ACGGAACTCC AGTCGACATC ATGTTGAACC CACTTGGGGT 1140 

GCCATCACGT ATGAATATCG GTCAGGTTAT GGAGCCTCAC CTTGGTATGG CAGCTCGTAC 1200 

TCTTGGTATT CACATTGCGA CACCAGTCTT TGATGGAGCA AGTCCTGAAG ATCTTTGGTC 1260 

AACTGTTAAA GAAGCAGGTA TGGATAGCGA TGCCAAGACA ATCCTTTACG ATGGACGTAC 1320 

AGGTGAACCA TTTGATAACC GTGTTTCTGT TGGAGTCATG TACATGATCA AACTCCACCA 1380 

CATGGTTGAC NATAAATTGC ACGCGCGTTC AGTCGGACCT TATTCAACTG TTACCCAACA 1440 

ACCACTCGGA GGTAAAGCTC AGTTTGGTGG ACAACGTTTC GGTGAGATGG AGGTTTGGGC 1500 

TCTTGAAGCC TACGGTGCGT CAAATGTCCT TCAAGAAATC TTGACTTACA AGTCTGACNA 1560 

TATCAACGGG ACGTTTGAAA GCCTATGAAG CTATTACAAA AGGCAAACCA ATTCCAAAAC 1620 

CAGGTGTTCC AGAATCCTTC CGAGTTCTTG TCAAAGAATT GCAATCTCTT GGTCTTGACA 1680 

TGCGTGTCCT AGACGAAGAT GACCAAGAAG TGGAACTTCG CGACTTGGAT GAAGGAATGG 1740 

ACGAAGATGT CATCCACGTA GATGACCTTG AAAAAGCCCG CGAAAAAGCA GCCCAAGAGG 1800 
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CTAAAGCAGC CTTTGAAGCT GAAGAAGCTG AGAAAGCAAC AAAAGCGGAA GCAACAGAAG 
AAGCTGCTGA ACAAGAATAA GCAGTTCACT TAGAATAGAA AGGGAAGAAA TAGTGGTTGA 
TGTAAATCGT TTTAAAAGTA TGCAAATCAC CCTAGCTTCT CCAAGTAAAG TCCGTTCATG 
GTCTTATGGA GAAGTCAAAA AACCTGAAAC AATCAATTAC CGTACCTTGA AACCAGAACG 
TGAAGGACTC TTTGATGAAG TGATCTTTGG TCCTACAAAA GACTGGGAAT GTGCTTGTGG 
TAAGTACAAA CGCATTCGTT ACAGAGGAAT TGTTTGTGAC CGCTGTGGGG TTGAAGTAAC 
GCGTACGAAA GTTCGTCGTG AGCGTATGGG ACATATCGAA TTGAAAG 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



CGACTGAAAT 


GGGTGAATTG 


TACTCTTGGC TTGGTTTGTC 


AGTAGGGATT 


AACTTGGCTA 


60 


CCAAATCTCC 


AATGGAGAAA 


AAAGAAGCCT ATGAGTGTGA 


TATTACTTAC 


TCAACTAACT 


120 


CAGAAATCGG 


ATTTGACTAC 


CTTCGTGACA ATATGGTCGT 


TCGCGCTGAA 


AACATGGTAC 


180 


AACGTCCGCT 


TAACTATGCC 


TTGGTCGATG AGGTTGACTC 


TATCTTGATT 


GACGAGGCTC 


240 


GTACACCTTT 


GATTGTATCA 


GGTGCCAATG CGGTTGAAAC 


CAGTCAGTTG 


TATCACATGG 


300 


CAGACCACTA 


TGTAAAATCT 


TTGAACAAAG ATGACTACAT 


CATCGATGTG 


CAGTCTAAGA 


360 


CTATTGGTTT 


GTCTGATTCA 


GGGATTGACA GGGCTGAAAG 


CTACTTCAAA 


CTTGAAAACC 


420 


TCTATGACAT 


CGAAAACGTG 


GCTTTGACCC ACTTTATCGA 


TAACGCCCTT 


CGTGCCAACT 


480 


ACATCATGCT 


TCTCGATATT 


GACTATGTGG TGAGCGAAGA 


GCAAGAAATC 


TTGATTGTCG 


540 


ACCAATTTAC 


AGGTCGTACC 


ATGGAAGGTC GTCGTTATTC 


TGATGGATTG 


CACCAAGCTA 


600 


TTGAAGCCAA 


AGAAGGTGTG 


CCAATCCAGG ATGAAACCAA 


GACATCTGCC 


TCAATCACGT 


660 


ACCAAAACCT 


TTTCCGTATG 


TACAAAAAAT TGTCTGGTAT 


GACGGGTACA 


GGTAAGACTG 


720 


AGGAAGAAGA 


ATTyCGTGAA 


ATCTACAACA TTCGTGTTAT 


TCCAATCCCA 


ACAAACCGTC 


780 


CTGTTCAACG 


TATTGACCAC 


TCAGACCTTC TTTATGCAAG 


TATCGAATCT 


AAGTTTAAAG 


840 


CGGTTGTCGA 


AGACGTTAAG 


GCTCGTTACC AAAAGGGTCA 


ACCTGTCTTG 


GTTGGTACAG 


900 


TAGCGGTTGA 


AACTAGTGAC 


TACATTTCTA AGAAATTGGT 


TGCAGCTGGT 


GTTCCTCACG 


960 


AAGTCTTGAA 


TGCCAAAAAC 


CACTATAGAG AAGCCCAAAT 


CATCATGAAT 


GCTGGTCAAC 


1020 


GTGGTGCCGT 


TACCATCGCA 


ACCAACATGG CGGGTCGTGG 


TACCGACATC 


AAGCTTGGTG 


1080 


AAGGTGTTCG 


TGAACTTGGA 


GGACTTTGTG TTATTGGTAC 


CAGAACGTCA 


TGAAAGTCGT 


1140 


CGTATCGATA 


ACCAGCTTCG 


TGGACGTTCA GGTCGTCAAG 


GAGATCCAGG 


TGAGTCACAA 


1200 


TTCTACCTAT 


CTCTTGAAGA 


TGATTTGATG AAACGTTTTG 


GTTCTGAACG 


CTTGAAGGGA 


1260 


ATCTTTGAAC 


GCTTGAACAT 


GTCTGAAGAG GCCATTGAGT 


CTCGCATGTT 


GACGCGTCAG 


1320 


GTTGAAGCAG 


CTCAGAAACG 


TGTCGAAGGA AATAACTACG 


ATACCCGTAA 


ACAAGTCCTT 


1380 
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CAATACGATG 
ATCACTGCAG 
CGTGTCGTTG 
TTTGCTAAGT 
TCTGATAAGG 
GTTTCAAAAC 
GTGGTGGATA 
GGACTTCGTG 
ATGTTTAATG 
CAAATTCATG 
AATATCGCTG 
AATGAACTTT 
TAAAATGAGA 
TTTGCTTTAT 
AAGAAGTTCG 
ATCAAGAGCT 



ATGTCATGCG 
ATCGTGACTT 
ATGGTCATGC 
ACAACTTGCT 
CCATCAAGGA 
TACGCGATGA 
ACAAGTGGAC 
GCTATGCTCA 
ATATGATTGG 
AACAAGAAAG 
CTCACCAAGC 
GCCCATGTGG 
TAGTTTAGAG 
AAGGAGATGA 
TGCCTTGTCA 
AGAAGCCATT 



TGAACAACGT 
GGCACCTGAA 
GCGTGCCAAA 
TCCTGAAGAT 
AGAGCTTTTC 
AGAAGCAGTT 
AGATCATATC 
GAACAACCCT 
TTCGATTGAG 
ACCACAGGCA 
AAGTATGCTA 
TTCTGGTAAG 
GCGGATATCT 
GTTATGGTAT 
AAATTAGAAG 
ATACGTGGAG 



GAGATTATCT 
ATTCAGTCTA 
CAAGATGAAA 
TCTATTACGA 
CAACGTGCCT 
AAAGAATTCC 
GATGCCCTTG 
GTTGTTGAGT 
TTTGATGTGA 
GAACGTCATA 
GAAGATTTGG 
AAATTTAAAA 
TGTGAAAAGT 
TTACAGCAAA 
GTCAGGCTTT 
AAGACCAGCG 



ATGCTCAACG 
TGATTAAGCG 
AACTAGAGGC 
TGGAAGACTT 
TGAAGGTTTA 
AAAAAGTTTT 
ATCAATTGCG 
ATCAGGCAGA 
CACGCTTGAT 
TCAGTACAAC 
ATTTGAGCCA 
ACTGTCACGG 
AAATTTTTAC 
AAGCTCTAAA 
GGAGAGGAAA 
AATTCTCTTG 



TTACGATGTC 
CACGATTGAA 
AATTTTGAAC 
GTCAGGCTTG 
CGATAGTCAG 
GATTCTACGA 
TAACGCGGTT 
AGGTTTCCGT 
GATGAAAGCA 
AGCGACTCGC 
GATTGGACGC 
TAAAAGACAA 
TGGGTATCCG 
ATAAATATAG 
TCACAGCGAG 
GTAATCGGG 



(2) INFORMATION FOR SEQ ID NO; 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2339 



<ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



TTAATNGGTA 
TTGGTATCGC 
TGTCATTATC 
TGCCTACCGC 
ATGGCTATAT 
TTCCACCAGC 
GTTGGTTGAT 
CTCAAGTTAG 
ATCTTAATTT 
AATGCAGTTA 
CCGGGGGAAT 



CTTTGGCGAG 
TTACTTTATG 
ATCTTTTATT 
TTTCATGGCA 
CTACCAATGG 
AGTATCTAAA 
TGTTTATGCA 
CTTTGGGAGG 
TCCTTAATAG 
TGAAACCGTT 
CCCNCTATTT 



NCCTTTTAGG 
GCTAGACAAT 
ACAGTTACGC 
TCAAAAGGTT 
TTTATAAATC 
AGCTTTAGTG 
ACTTTAGATA 
TCCACTTGGA 
TAGTTTTTGG 
ATGGTTAGCT 
CNTAAACGGG 



NAAAATGGTA 
ACAAAGTTGA 
CTTTCATTAC 
TATTTGTTGC 
ATAATATACA 
CCATTATTCC 
AATTAAGTTT 
CTTTTAGGAA 
TTTGTAGGAT 
AATCTAGATG 
CCCCCCCCCG 



TCNATGGCCN 

TGGTATTCCT 

AGGAGAGGCT 

TATGATTTTA 

GATAAAAATG 

TGGTGCAGTG 

ACCTAATCTT 

ATAATGTTAT 

TACACGGAGG 

CAAATAAAGT 

NG 



TGATCTAGTT 
GCTGGTATTG 
GGAGCAGGAA 
GGATTGATTA 
CCGGATGGTG 
ACTATTGTTG 
CATGAAATTG 
TGGTCTTCTT 
AAATGTTGTT 
ACNCGCCCNC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
642 



(2) INFORMATION FOR SEQ ID NO: 49: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1460 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

CTTACTTTGC AGAAATTTTC CGTGGGGGAT TGACACTATT CCAAGAGGAC AGTATGAGGC 60 

TGCCAAGGTC TTGAAGTTTA GCCCTTTTGA CAGAGTGCGC TATATTATCT TGCCCCAAGT 120 

GACCAAGATC GTTCTTCCTA GTGTCTTTAA TGAAGTTATG AGTTTGGTCA AGGATACTTC 180 

TTTGGTCTAC GCTCTCGGAA TTTCAGACC7 TATCTTGGCT AGTCNAACAG CTGCTAACCG 240 

CGATGCTAGT CTGGTTCCTA TGTTCTTGGC AGGAGCTATT TACTTGATTT TAATTGGGAT 300 

TGTGACAATT ATTTCCAAAA AAGTTGAGAA AAAGTACAGT TATTATAGAT AGGGGGCAGG 360 

ATTCGATCTA TGTTAGAATT ACGAAATATC AATAAAGTCT TTGGAGACAA ACAAATCCTG 420 

TCTGATTTCA GTCTAAGTAT TCCTGAAAAG CAAATCCTGG CTATCGTTGG ACCTTCTGGT 480 

GGAAGTAAGA CAACTCTTTT ACGTATGCTT GCGGGTCTTG AAACCATTGA TTCAGGGCAA 540 

ATCTTTTATA ATGGACAACC TTTAGAGCTG GATGAATTGC AGAAGCGCAA CCTACTGGGA 600 

TTTGTCTTCC AAGATTTTCA ACTATTTCCT CATCTATCAG TTCTGGAAAA TTTGACTTTA 660 

TCGCCTGTGA AGACCATGGG AATGAAGCAG GAAGAGGCTG AGAAGAAGGC GAGTGGACTC 720 

TTGGAACAGT TAGGACTAGG AGGACACGCA GAGTCCTATC CTTTCTCACT ATCTGGTGGG 780 

CAAAAGCAGC GGGTGGCTTT GGCGCGTGCT ATGATGATTG ACCCAGAAAT CATTGGCTAC 840 

GATGAACCAA CTTCTGCCCT GGATCCAGAA TTACGTTTGG AAGTGGAGAA GCTAATCTTG 900 

CAAAATAGGG AACTTGGGAT GACCCAGATT GTGGTTACCC ATGATTTGCA GTTTGCTGAA 960 

AATATCGCCA TGTATTATTG AAAGTAGAAC CTAAATAGGA GGAAAAATGG ATGAAAAAAT 1020 

GGATGCTTGT ATTAGTCAGT CTGATGACTG CTTTGTTCTT AGTAGCTTGT GGGAAAAATT 1080 

CTAGCGAAAC TAGTGGAGAT AATTGGTCAA AGTACCAGTC TAACAAGTCT ATTACTATTG 1140 

GATTTGATAG TACTTTTAGT TCCCAATGGG ATTTGCTCAG AAAGATGGTT CTTATGCAGG 1200 

ATTTGATATT GATTTAAAAA CANCTCTTNT. TGNNAAATAC GGAATCACGG TAAATTGGCA 1260 

ACCGATTGAT TGGGATTTGA AAGAAGCTGA ATTGACAAAA GGAACGATTG ATCTGATTTG 1320 

GGATGGCTAT TCQQCTACAG ACGAACNCCG TGANAAGGTG GCTTTCAGTA ACTCATATAT 1380 

GAAGAATGAG CAGGTATTGG TTACGAAGAA ATCATCTGGT ATCACGACTN CAAAGGATAT 1440 

GACTGGAAAG ACATNAGGAG 1460 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



CTGCTATCTT CATCGTTTCA ATCAATACCG GTGCCTACAT GACTGAAATC GTCCGTGGTG 60 

GTATCCTGGC AGTTGACAAG GGACAATTTG AAGCTGCGAC TGCTCTTGGT ATGACTCATA 120 

ACCAAACCAT GTGTAAGATT GTCCTACCTC AGGTTGTCCG CAATATCCTA CCAGCAACTG 180 

GTAATGAATT TGTCATCAAT ATCAAAGATA CATCTGTATT GAACGTTATA TCTGTTGTCG 240 

AGCTTTATTT CTCAGGAAAT ACCGTGGCGA CGCAAACCTA TCAATACTTC CAGACATTTA 300 

CAATCATCGC CGTGATTTAC TTTGTCCTCA CCTTCACCGT AACACGTATC CTACGCTTTA 360 

TCGAGCGCAG AATGGACATG GATACCTATA CTACAGGTGC TAACCAAATG CAAACGGAGG 420 

ATTTGAAATA ATGACACAAG CAATCCTTGA AATTAAACAC CTCAAAAAAT CCTATGGACA 480 

AAACGAAGTG CTAAAAGACA TTTCTCTCAC TGTCCACAAG GGAGAAGTTA TCTCTATCAT 540 

CGGAAGCTCT GGAAGTGGGA AATCGACCTT CCTACGCTCC ATTAACCTAC TTGAAACGCC 600 

AACTGATGGA CAAATCCTTT ATCATGGACA AAACGTCCTC GAAAAAGGCT ATGACCTCAC 660 

GCAATACCGT GAAAAGTTAG GGATGGTTTT CCAATCCTTT AACCTCTTTG AAAATCTCAA 720 

CGTTCTTGAA AACACAATCG TCGCTCAGAC AACTGTCCTT AAACGCGAAC GCACAGAAGC 780 

TGAAGAAATT GCCAAAGAAA ACCTGGAAAA GGTCGGCATG GGAGAACGCT ACTGGCAATT 840 

GCCAAACCAA ATAGCAACTA TCAGGTGGTC AAAAACAACG TGTGGCCATC GCTCGTGCCC 900 

TATCAATGAA TCCAGACATC TATGCTCTTT GATGAACCAA CTTCTGCCCT TGACCCTGAG 960 

ATGGTTGGAG AAGTAATTAA CGTTATGAAG GAATTGGCTG AGCAAGGCAT GACCATGATT 1020 

ATCGTAACCC ATGAGATGGG ATTTGCCCGC CAGGTTGCCA ACCGCGTTAT CTTTACTGCA 1080 

GATGGCGAGT TCCTTGAAGA CGGAACACCT GACCAAATCT TTGATAACCC ACAACACCCT 1140 

CGTCTGAAAG AGTTCTTAGA TAAGGTCTTA AACGTCTAAA CTCAAACTGC AAGGATTTCC 1200 

TTGCAGTTTT TCTACCTCGT ATTGGAATTT TTGATTTTTC GGAAAATTAT GTTAGAATTA 1260 

AGTTTATGAA ATGAGGTTTC CTCATACCTA GCAAGACTAG GAATAAAAAT AGAAATTAGG 1320 
TAG 



1323 



(2) INFORMATION FOR SEQ ID NO: 51: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2469 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 



CTCTTCAAAT TCTTCACGAG TACGGAAGAA ATAGTCAACA CCGTCCACTT CTCCAGGACG 
TTGTGCGCGT GTCGTCATCG ATACAGAATA TTGAAATTGG TTTTCAGAAC TCTCAAAAAT 

120 
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CTCTCTTCTA ACCGTTCCTT TTCCAACCCC TGAAGGACCA GAAAAAACGA TTAGTAAGCC 180 

TCGGTCTGCC ATTGTGTCTC CTTTTAGTCA ATCTGTGAAA TAACATTTCT CTAGAATAAT 240 

GGCAAAAAGC CAGATTATCC TTTACAGTCT TTCTATATAG TGTAACAAAA AAGCAGTAAT 300 

TTTTCAACTG CTCTTTCTTA TTTATTTAGC ATAATCTACT GCACGAAGCT CGCGAATCAC 360 

GGTTACCTTG ATATTTCCTG GATAATCGAG ATTGTTTTCA ATTTTCTTAC GAACTTTGTG 420 

AGCCAAGATT GTGACTTTGT CGTCCTTGAT TTTTCCTGGA TTGACCATGA TACGAATTTC 480 

ACGTCCTGCT TGAAGGGCAA AGCTAGTTTG CACTCCTTCA AAGCCGTTAG CAATTTCTTC 540 

CAAATCATGG AGACGCTTGA TGTAGCTTTC AAGAGACTCA CTACGAGCAC CTGGACGGGC 600 
TGCGCTCAAG GCATCTGCTG CAGCGACGAT AACTGCTATC ACGCTCTCAG CTTCAACATC * 660 

TCCGTGGTGA CTAGCAATCG TATTCACCAC AACTGGGGGT TCCTTGTACT TACGGGCCAA 720 

TTCCATACCG ATTTCAACGT GGCTACCTTC AACCTCATGG TCAATGGCTT TCCCGATATC 780 

GTGAAGGAAT CCAGCACGAC GGGCAAGAGC CGCATTTTCA CCAAGTTCGC TCGCCATGAT 840 

ACCAGCCAAC TTAGCAACCT CAATCGAATG GCGCAAAACA TTTTGTCCAT ATGAAGTACG 900 

GAACTGCAAA CGTCCCATAA TCTTCATCAA GTCTGGATGA AGGTTTGGCG CACCAATTTC 960 

ATAGGCAGCA GCCTCACCGT ATTCACGAAT CTTATTGTCA ATCTCTTGAC GGTTTTTCTC 1020 

AACCAACTCT TCGATACGAG CTGGATGTAT ACGACCATCT TTGAGCAACA TTTCCATAGT 1080 

CATACGGGCA ATCTCACGAC GAATCGGATC AAATCCTGAC AAGGTCACCA CTTCTGGTGT 1140 

ATCGTCGATA ATCACATCGA CCCCTGTCAA ACTTTCAAAG GTACGAATGT TACGACCTTC 1200 

ACGACCAATA ATGCGTCCCT TCATAGTATC GTCTGGCAGA TGAACTGTTG AGTTTGTTGA 1260 

CTCCGCTACA TATTCACCAG CGATACGTTG CATAGCTTGA ACCAAGATGT CCTTGGCCAT 1320 

TTTGTCAGAA CGTTCCTTGA CCTCTTGCTC AGCTTCGCGA ATGCGGACTG GCAATCTCCC 1380 

TGGTCAAGGT TTCCTCTGTC GGAGCCAAGA TAATATCTCG TGCTTCTGCC TGAGACAGAG 1440 

CACCAATACG CTCTAGTTCT GCGTCTTTTG GTCTTTCGAC TTCCTCTAAT TGCTCTTCAC 1500 

GCGCATCAAG GTTTTTCGCT CTATCAGAAA ATACTTTGTT CTTTTCTGTT CAAGTGTTTG 1560 

TTCTTTACTC GTCAAATTGT CGTCCTTACG GTCAAGGCTA GTAGCTCTCT CTGTCAAACG 1620 

ACTTTCGATT TGTTTGAGTT CTTGACGTTC TGATTTGAAT TCAGCGTCCA CTTCTTCACG 1680 

GTATTTTCTG GCTTCTTCTT TGGCCTCCAA TAGTGCTTCT TTTTTAAGAG ACTTGCTTTC 1740 

ACGTTTGGCT TCATTAACAA GTAAATCCGC TTCACGCTCA GCTTGTCCAC GTAAATTAGT 1800 

TGCTTCTTGT TCAGCATTTA AAAGCATCAA CTCTGCAGCT TCCTGAGATG ATTTCATCTT 1860 

AGCTGAGATG CTGACATATC CCAATGACTA AACCAATGAT GACGGCAAAA ACAGCAATCG 1920 

CAAGCGACAT GATTTCCATG TTTTTACCTC ATTTTATTGT TATTCCGAAT GACATACATT 1980 

CTTTTACATT CTACCATAAA AAAGTGATTT TCACAAACCT AAAATAGAAT ATGTTTTGGG 2040 

GAAATTGGAG TTTAAATGCC TATAATTTAT TGGCACAATC TCTATTTCTT AGAAATCGTG 2100 

GTTTAGTAAA GATTTGAGAA AAAGCCTACT TTTCAGTAGA CTTCCTAATA ATCGTTAGAA 2160 

TGCAAGAAAT TTGTTTCTAA CAACAGCAAG ATTTCACTCA GGTATTCTCT GTAAATTAGA 2220 

CTAGTGTCCT GCACGAATTT TATCCATTCC TCGCAAAACC TTGTCTGATA ACTCCTTATT 2280 

TTGGGCGATT CTCAGGGTTT CCTGAATACT GTCCCATTCG CTCTTTGAAA GAACTACAAT 2340 

GTCCTCATCT GGATTTTTAT TGACCACCGT CAAAGGCTCA AATTTATCAT TTACCTTCTT 2400 

CATGTAGTCC TTTAAATGAT TTCGGAATGT TGAGTAAAGG ACTGCTTCCA TAACCACACC 2460 

TCGTTTTAG 2469 

(2) INFORMATION FOR SEQ ID NO: 52: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1828 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



CTATTCATGT 
GAACTATTAG 
TTGCCTCAAA 
ATCGTATTCG 
AAAACCTGAT 
TCTTTCTTCA 
AAGTAAGCAC 
AAAGCCTACT 
ACTTCCTTAG 
AAGAAATTTC 
TACTCCTGGA 
CTTTATCTTC 
CAACTATTCC 
CTACTTGAAG 
TTACCTCTCC 
CAGACTTTGG 
ATACCCAATT 
TTGCCTTGGC 
ATCTGCTCCA 
CAACAGTCTA 
CTTTCCTAAA 
TAGCTTTCAA 
TAGTTCTAGT 
TTGTTTACAA 
CTAGGGATTG 
ATTACAGGGA 
ATTCGCTCAT 
AGCTTAGGAA 
GGTATCATTT 
ATCCTCCTCT 
CGACCTCGAG 



AAGCATTCGT 
CGATAGCGTC 
AATTCAAGTT 
TCTACGAATC 
AAAAGGAGTC 
TCTTTCTTAC 
TTATACATGA 
ACTCTGAGAC 
TTGTAGGAAC 
TACAAATATT 
TTCTCTTGCT 
CAGCTATCGA 
CACTGGTATT 
CAGCTGAAAG 
TAGTTCCAAC 
AACGCCTATG 
TATTAGCGAG 
AATTTTCCTT 
TCCAATTGAG 
CGGAATTATC 
AACATCAGGT 
TCGTATGGGA 
TGTTQCTATT 
ACTTAATTGA 
CCTTCATTTC 
CTGCTTTCAT 
CTGTTGCTAG 
GTAGTCGTCT 
CTGGAGCCAT 
ACAATGTCAA 
GGGNGCCCGG 



CCCGAAGAGT 
TATCTTGGAC 
AGTGAAGAAT 
AATACGCAAA 
AACCATGGAA 
CTATCTTGTC 
AGGATAATTC 
ACTAGTCAAC 
CCTATTAGCT 
GATTATCATT 
GGGACGAAAT 
TATTTATGGA 
TCTATATGTT 
CATGGGGTCC 
CTTACTAGCA 
TTGATTGGCG 
GTTGGAGGAA 
ATCCAAAAAC 
CCTAAAAAAA 
TTTATCTCTG 
ATGGTATTTG 
TCTGCTATTT 
TACGACATTT 
CAGCCTCAGT 
TTCCTTCAAT 
CTTGATTATG 
CTTACAACAA 
CAATACCTTT 
CTTATCTTGG 
AACAAGAACA 
GCCCAATG 



TTATCAAAGA 
TAAATACGGA 
CAACTTTTGA 
AATTAAACAT 
CGTAAAAAAC 
TTTCTCGTTT 
TCACTAGCTA 
AGTTTCAGGG 
TATCTCTTCT 
GCTTCCATGT 
GAGGTCATCA 
TTCAAAGGAA 
GCTGGAACAA 
TTCGGATTTA 
GCTCCTTGCT 
AAGGATATCG 
ATTCTGCTTT 
ACATTGCAAA 
CTACAAAAGG 
TTTTACCTCA 
TTAAAGGTTA 
TCAATACCAT 
ATCTCCTACC 
ATGGTACCTT 
ACTGGTCTAT 
TCTCTATCTG 
ATAGCACCAA 
GCTAAGATTA 
GTCACAATGA 
ATGACTGTAG 



TGAATCTGGA 
GTATTTCATT 
AGAAGAACTA 
CTTTTCTGCA 
TAAATATTTG 
ATCCTATCGT 
ATTTTGTCAC 
TTTCCATTAC 
CTATGTATGA 
CAACTCCTTT 
CTAAATTTTT 
TTATACTTGT 
TGAACAGTAT 
AACCTATCGT 
TGTATTTATG 
GACTTTCCCT 
TGCAATTATG 
CCGCTACAGT 
AAAAATGGCT 
AATCTACTTA 
TTCTCCAAAC 
TCGTATCCCT 
TAGCCGTTAG 
ATATTGTACC 
TTGGAAGTGG 
TCAGAAGATT 
GTATTGAAGA 
CAACTCCAAT 
TTTCAAAACT 
ATCAAGCTTA 



GATATTGAAG 
GAGACAGGTT 
CAAAAAGGCA 
GATGGTTCCC 
GACAGCCTCC 
TACCGTGCTC 
TTTCTTTAGT 
CGCTACTGTC 
CTTCAAGGGG 
CGTAGGAGCC 
GACAAATGCC 
CTTTACACTG 
TGACAATTCT 
AACGGTTGTT 
AGAGCATTCT 
GTCCTGATTT 
GCGATTATCA 
TTCAGCATGA 
GCCATTTATG 
ATTTATACCT 
AGTTACAAGG 
TTGATTGCCT 
AAAACGGAAT 
AGGAACCGTT 
ATTTCTTATG 
ACCGTATACT 
AGCTGCTGAA 
GATGCTATCT 
CTCTACTTCT 
TNGATACGGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1828 



(2) INFORMATION FOR SEQ ID NO: 53: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1176 base pairs 

* 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

CTAAACTTTC GCTCATAGGC ATACAAATTA ATCCTTTGGC ATAAGTAGCC ATAAAATTAA 60 

CATTTTCTGT TGTAGCTGCT TGTGCAGAAC AAATTAAGTC TCCTTCATTT TCTCTATCCT 120 

TGTCGTCTAT AACAAGAACA AGTCGTCCCT TCTGCAATGC TTCTAATGCT TCTTGTATTT 180 

TTCGATATTC CATTGACTGA TTATCCTTTC TGCTAAAATC CATTTTGATA TAATAGTTCC 240 

TTCGATATTT CTGATTTTGG AGAGTTATCC ATCAGTTTTT GCACATATTT ACCTAAGATA 300 

TCATTTTCAA GATTTACTGT ACTCCCGACT TGTTTACTCT TAAGAATGGT TTGTTTCAAG 360 

GTATGAGGGA TAACAGATAC TGAAAAGTTT ACTTTGGAGA CTTTAGCGAC AGTCAGACTA 420 

ATGCCGTCAA TTGTAATAGA TCCTTTTTCA ACTATTAAAT CTAAAATTTC TTTTTGTGTG 480 

TTGATTTGAT ACCATACAGC ATTATCATCT TTTTTTATTG ACGAGATTTT TCCTGTACCA 540 

TCAATGTGTC CTGTAACGAC GTGACCCCCA AGTCGACCGT TGACAGATAA GGCTCTTTCT 600 

AGATTCACCT CACTTCCATG TTTTAATAGA GTAAGAGAGC TGTTCGACTC CATGTTTCAT 660 

TCATTACATC AACTGTAAAG GATTGATGAT TGAAATGAGT AACTGTAAGA CAGATACCAT 720 

TTACTGCTAT ACTATCGCCT AAATGGATAT CCGTTAATAT TTTTGAGGCT TTAATTGATA 780 

GTTTACAATT ACGAGAGTCT TTCTGTATTC TTTCAACTTT TCCGATTTCT TCAATTATTC 840 

CTGTGAACAT GGATAAATCA CTTCACTTTC TATGAGATAG TCATTTCCTA TTTGAGAAAA 900 

AGCATAAGGT TTCAATCTAA TAGCGTCATT TGGCAAAGAA ATGCCTTCAC CTCCGACAGG 960 

AAACTTGGCA CTGCCTCCAA AAATTTTTGG TGCAATATAT ATTTTCAGCT CATCAACAAT 1020 

TTGTTGTTCC AAAGCACTCC AATTCATTAG ACTGCCCCCC TCTAGAACTA GGCTATCAAT 1080 

CTGCATGTTT CCTAGATGTT GCATTAAACT CGATAAGTCT ATATGATTGC CTTTTTTCTT 1140 

TATGGGAAGT ATTTCACAGC CATGATTTTG ATATAG H76 

(2J INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1449 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
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60 



CTATCGATAA CATGGATGCA GACGTTATTT CCTTTGAAGC TAGCCGTTCA AACCTTGAAA 

TCTTGGACGA ACTCAAAGCG AAAAACTTCC AAACAGAAGT GGGACCTGGG GTTTACGATA 120 

TCCACTCACC TCGTGTACCA AATGAAGGCG AAATCGACAA CACAATCGAA GCCATCCTTG 180 

CTAAAGTGCC AAGCAAGAAA GTTTGGATCA ACCCTGACTG TGGTTTGAAA ACACGTGGTA 240 

TCCCAGAAAC AAAAGAAAGC TTGATCCGCC TTGTTGAAGC AGCTAAAGCT GCGCGTGAGA 300 

AATTGTAAGA TTGGATTTCT CTCCATACAG TGGTTGTTCA AAAAGAAATC AACTAGAAAA 360 

GGTTATTACA TATGTCACGC CAAACACCGT CACTCTCATT TGAAGTGTTC CCTCCAAACC 420 

CAGCCGTGGG TAATGATAAC ATTATTTCAG CCTTGCAGGA TATGCAGGAG TTCGCTCCCC 480 

ATTTTATCAG TGTAACTGCC AGCAATAATA AATTTAATAT CAAGGAAACG ACGGTTCGTT 540 

TGGCTGACTT TATCCAAAAT GATTTGGCGA TTCCGACCAT TGCCCACTTG CCAGCTATTT 600 

ATCTAACTAA GGACAAGGTT GCTGAAACTA TTGCTGACTT GGACAAAGTT GGGGTGCAGA 660 

AAATCTTGGC TCTTCGTGGG GATATTATTC CAGACGTGGA ACCACAAAAG GATTTCCGCT 720 

ATGCAACCGA CTTGATTGAG TTTATCAAGG AACAAACCCC TCACTTTGAT ATTATTGGAG 780 

CTTGCTATCC AGAAGGGCAT CCAGATTCGC CAAATCAGAT TTCAGATATT CAAAATCTTA 840 

AGAAGAAAGT GGATGCAGGC TGTTCGAGCC TCGTAACTCA GCTCTTCTTT GACAATGAGC 900 

GCTTCTATGA TTTTCAGGAC AAGTGTATTT TGGCTGGGAT TGATGTTCCC ATTCATGCAG 960 

GAATTATGCC AATTCTGAAT CGAAATCAGG CTCTCCGACT CTTGAAGACT TGTGAGAATA 1020 

TCCATCTTCC ACGCAAATTT AAAGCCATCT TAGACAAGTA TGAGCATGAC CCTGAGTCGC 1080 

TCAGAGCAGC AGGACTTGCC TATGCAGTGG ACCAAATCGT GGACTTGGTA ACTCAGGATG 1140 

TTGCCGGTGT GCATCTCTAT ACGATGAATA ATGCTGATAC AGCAAAATAC ATCCATCAAG 1200 

CAACCCATGC CTTGTTTAAT CACCAGTCTC TAGGATAATA AAAGCAAACC ATTCTTCTCA 1260 

GGTGAGGGGA ATGGTTCCTT TTTAATGGCA AAGACCTCAC TTTTTAAGAA AAATATGATA 1320 

AAATAGATTC TGTACGTACT TGATACAAAG ATGAAGGTAT TAAATAATAC AATGCATTTA 1380 

AACTGTAAGA TTTAACGACA CGAGTGAGTC AAATCGATGT TATTTGACGA AAGAGTTAGT 1440 
AAAGCATTT 

1449 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1343 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

CTCGGTTTCC TTTTCACGAT TAAGCTCGTT GTAGAGTTTT TTTAGCACAC GGTTCATCTT 60 

GAGATTTTCT TGCTCCACCT CACGGATATT GTCCAAGCGT TTGCGGCTTT CCAGCGTCTG 120 

CTCTTCTAGT TGCTCAATGA TACGATTGAC GTCATTGTCC TGATCGATCT GCTGACTGGC 180 

ATCTCCTACG ATAACTTCAG ATAGGCCTAG ACGTTTGGCA ATTTCAAAGG CATTACTTCG 240 
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GCCAGGAACA CCCTGCATAA AGCGATAGGT 
ACTGGCATTT TGCACAAAGG CTGTCTCAAT 
CGTCGCCATG GTCTTGATTT GACGCAGGCG 
TGCTCCCTCT TGGGGATCAG TACCAGCCCC 
GTTGACCTTG CCAAGAATAT CCACGATATT 
CTGCTCAATA GACTGCTCAT CTCCAATATC 
ACTTCCCTTG TCTGCTAAAA TCGGCAATCC 
AGTTTTGAGC ATGATGGTCT TCCCACCTGT 
AATCTTGACC AAAATAGACA TCATTTGCGA 
GGAGCAGTTG AATCTCTTGA TTTTCTGACA 
TAAATCGAAC CTTGGCACGA ATCAAGTCCA 
TCTCAGCCGC ATGAGGGCGG ACACGCTCAG 
AGCGCTCATC TGCTCGCAGA CTAGCAATTT 
CGATATAGAC GGTGTTTCCA CTAGCAGAAA 
GGTAGGTGTT TTTGACTGGT AAAACCTGAC 
TCAACATCTG CGCTTTTTGC TTGAGCAAGT 
TATCATGTAT TTTTCGACGG ATTCGCGCCA 
CCGCATCATT AAAGGCCTGA AGATTTCCTT 
ACCAAAGGGC TAATTCTTCC AAG 

(2) INFORMATION FOR SEQ 
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CGGGCGAAGA GTTGCAGTAT 




300 


ACCGTAGGCC TTGAGTTCTf; 




360 


AAGGTCCTCC AGAATAGCCA 




A "> A 


CAACTCATCC AAAAGTAAGA 




a on 


GGTCATATGG CTAGAGAAGG 


TAGAPAAOPT 


5 /I rt 


AGCAAAGATT TCTTCAAAAA 


x AULnAWiWo 


c n a 


TGACTGGGCC ATGACCTGTG 


TP A AflPPP Rr» 


£ £ A 
0 DO 


ATTGGGACCT GTAATGAPAA 


TAfTPPTPTTA 


T ~i A 


CGGCATTTTT GACCAAAGGA 


x\*Viv.AViACAT 


780 


GCTGAGGCAC GACTGCTTnT 


CI lTvTTGGA 


840 


GATGACCGAT AATPP A AfW* 


1 LAi "1 AGCAA 


900 


x * * w * x \j vuvvarLri 1 uLua 


AGCATTTCAT 


960 


CTTCGCTCAG TTTGACTACC 




1020 


TATCAT£5AAP rtir , ar , pTv , p k 


ATCTTATTGC 


1080 


GGCCATTTCT GCTAGCAACA 


ATTCCTTCCG 


1140 


CTTGTAAAAC ATCGCGTACC 


TGACTCTCGC 


1200 


ATTCTTCACT GGCAAAATTT 


TCAATGAAAC 


1260 


GTAATTGCGG AAAATCATGT 


AATTTCTCAA 


1320 






1343 



10 NO: 56: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 

CCGATCTCTT TTACAGAGGT GTTTCCTATT TTGTGCTTCC ATTTTTGGGG CAAGGAATTG 60 

TAGATGGGGA TGGCAATCCT ATCTTTTTAT TGATTATGAT ATTCGTTTGC TTCATAGTTT 120 

TAGTCTTTTT GAAATGGTTA GACTATGATT TCACTAGATT GAGAAGGGAG TTTCTAGATA 180 

CAGGTTTTCA AAAGTCTCTT ACTAAGATTA ACTGGGCAAT GGGGGCTTAT TATCTAGTGA 240 

TGCAAAGTCT ATCTTACCTT GAATATGAAC AAGGTATTCA ATCAACGACT GTTCGCCATC 300 

TCATCCTAGT GTTTTACCTA CTCTTTTTTA TGGGGGGTAT CAAGAAATTG GATACCTATT 360 

TGAAGGAAAA ACTTCAGGAG GAACTGAACC AAGAGCAGAC CTTGCGCTAC AGAGATATGG 420 

AACGCTATAG TCGGCATATA GAGGAACTTT ACAAGGAAAT TCGGAGTTTT CGCCATGACT 480 

ACACTAACCT CTTAACCAGT TACGTTTGGG CATTGAAGAG GAGGATATGG AGCAGATAAA 540 

AGAGATCTAC GACTCGGTCT TAAGGGATTC CAGTCAGAAA TTGCAGGACA ATAAATATGA 600 

CCTGGGCAGA TTGGTGAATA TTCGTGACCG TGCCCTCAAG AGTCTCCTAG CTGGAAAATT 660 
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(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

CTGACTATCA AAAAAGACCA TTTTCTTCTT TTTCTCATCG ATAATACGAG AATATTTTCG 60 

GAAATTAGAG GTCGGCTGAA TAATAAAGCC GTCTACTCCC AAGAGAAGCA TGCTTTCAAT 120 

ATACCGGTCC TCACTCTCTT GGCTGTAATT ACTATTTCCT ATCATTACCT GGTAGCCATT 180 

CTGGCTGGCG ATATCCTCAA TTCCCTTAAC AATTTGGTTT GAGAAACTGT TGGTAATATC 240 

ACCGATCAAA ACACCGATTA ATTTTGTTCG TTTGGAGTTT TAAGCTACGC GCAACAATGC 300 

TCGGTTTGTA ATTTGTTTCA TGAATAACTT TTTCAATCTT TTCACGTGTC TCTTGGGACA 360 

TTTTTTCATA TTTCCCGTTT AGGTAAAATG ACACGGTTGT TTTCGAGGTC TGAGCCATTT 420 

CCGCAATGTC TTTTATGGTC AGTTTCTTCT CCAATCTAAA ACACCTCCTA CTAATACTTA 480 

TTATAACATA TTCACTATTC AAAATGCTTA TTTCTAGCAC TGCAAGCGAA TCATTCTCTC 540 

ATTTATTTTA TCATATACTA GACATTTTCC ACGCATTTTT GTTCCGTCAA CGAGGCACAA 600 

TTTTTCACCT ACACCAATAT CCTCGCTTAA TAACAGAATC AGCTTGTCTG TTTCGTCATT 660 

TTTCACTTCA AAAGCCAGAG CATTTTCCAC TTCACGATCA TCTGTCTGGT AGACAGAATG 720 

ACGGATGATT TCAAAACTCT CATGCGCAAT GATGGTATAA TCCAGCATCT GATTCTCAAA 780 

GAATTGGCGC TTGGTGAGTT TGCTACTTCT TTCAAGCTCA TTGTATTTAG GAGAAATGAT 840 

GGTATCTTCC AAATCAAAAT CAACTTCACT CCATAGTCTC AACTGATTGA TTTTCCCATC 900 

TTGATAGGTC ACATCCTTGT CAAGGATAAA CTGAGTCAAC GCCTCATGCT GACCTTGACA 960 

CCTGATGTCA TCTACCAAGA GCCAGACATC CTCTACCAAC ATGAGGATTT TTCTCTTGTG 1020 

AAGATAAGGC AAATCAGGTT CTGCTGACCA ATAAGCCCCT TCAATATAAT GCATTCCCTC 1080 

CCTTTCTTTA TGGTGACAAA ACAGGGAGTG AGGATAGTAT TCATATTCCC AGGATCCCGT 1140 

GATTCTTTCC GGA^CTTTCC CATCTACAAT GCAGGTCGAA TGACTCCAAG CACTCTTTAA 1200 

GAGATAACGT TCATATATCT CCCGATAAGA ATAACGCCCA GCATCTATGA AAATAGGTTG 1260 

GCCTTGATAC TGTAAGCAAA AACTATTCTC GTCACTATGG CTATGGGCAC TTCCTAGCGG 1320 

ACCATTTTTG AAAAATAGAT AACGATGTTC ATCCTTAATG CAGACATGTC CAGAGTCTTC 1380 

AAAGATCATG GACTTAGGCT GCCAAGCTCT CTTTTCAAAT TCCTGCAGTC GCTTGATCTT 1440 

TTCTCGCCCC AGGAACAAGA GGCTAACCTA ATCGACTTTA ACATCCAAAC CGTTAAGAAG 1500 

GTCTCCCTTG TTCAAAACCA CAGCAGACAG GCTCAAGTTA TCTGTCGTGT CTGTTGAATC 1560 

GCTATCACCA ATTTCCCATA GTTCGTCC 1588 

(2) INFORMATION FOR SEQ ID NO: 59: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 



60 
120 
180 
240 



NTCTTGGGCN CNGGGCGNNT CCTTTGAGGA CNACGGTATC GATGACCTTG ATCTCAAGTG 
CAAGCAGTAT CTGAATCTGC AGCAGCACCT GTCCGTGCAA AAGTTCGTCC AACATACAGT 
ACAAACGCTT CAAGTTATCC AATTGGAGAA TGTACATGGG GAGTAAAAAC ATTGGCACCT 

TGGGCTGGAG ACTACTGGGG TAATGGAGCA CAGTGGGCTA CAAGTGCAGC AGCAGCAGGT „ u 

TTCCGTACAG GTTCAACACC TCAAGTTGGA GCAATTGCAT GTTGGAATGA TGGTGGATAT 300 

GGTCACCTAG CGGTTGTTAC AGCTGTTGAA TCAACAACAC GTATCCAAGT ATCAGAATCA 360 

AATTATGCAG GTAATCGTAC AATTGGAAAT CACCGTGGAT GGTTCAATCC AACAACAACT 420 

TCTGAAGGTT TTGTTACATA TATTTATGCA GATTAATTTA CAGAGGGACT CGAATAGAGC 480 

CCTCTTTTCA GGTTTTACCG TGACAATCCC TATTAAAAAT TATATCAAAA TCGTGAAAAT 540 

ATTGGAAAAG TATGGTAGAA TGAAAATTGT CGTGTGAACG ATAATACTCA TTCTTGATGA 600 

ATTGTGAAGC AGTTGCCCTT GGGTCGTTTT GCGAGTTGAA GTCAAGAAGA GGAAAAAAAC 660 

AAAAAGGAGA AATACTCATG GCAGTAATTT CAATGAAACA ACTTCTTGAG GCTGGTGTAC 720 

ACTTTGGTCA CCAAACTCGT CGCTGGAATC CTAAGATGGC TAAGTACATC TTTACTGAAC 780 

GTAACGGAAT CCACGTTATC GACTTGCAAC AAACTGTAAA ATACGCTGAC CAAGCATACN 840 

ACTTCATGCG TGATGCAGCA GCTAACGATG CAGTTGTATT GTTCGTTGGT ACTAAGAAAC 900 

AAGCAGCTGA TGCAGTTGCT GAAGAAGCAG TACGTTCAGG TCAATACTTC ATCAACCACC 960 

GTTGGTTGGG TGGAACTCTT ACAAACTGGG GAACAATCCA AAAACGTATC GCTCGTTTGA 1020 

AAGAAATTAA ACGTATGGAA GAAGATGGAA CTTTCGAAGT TCTTCCTAAG AAAGAAGTTG 1080 

CACTTCTTAA CAAACAACGT GCGCGTCTTG AAAAATTCTT GGGCGGTATC GAAGATATGC 1140 

CTCGTATCCC AGATGTGATG TACGTAGTTG ACCCACATAA AGAGCAAATC GCTGTTAAAG 1200 

AAGCTAAAAA ATTGGGAATC CCAGTTGTAG CGATGGTTGA CACCAATACT GATCCAGATG 1260 

ATATCGATGT AATCATCCCA GCTAACGATG ACGCTATCCG TGCTGTTAAA TTGATCACAG 1320 

CTAAATTGGC TGACGCTATT ATCGAAGGAC GTCAAGGTGA GGATGCAGTA GCAGTTGAAG 1380 

CAGAATTTGC AGCTCCAGAA ACTCAAGCAG ATTCAATTGA AGAAATCGTT GAAGTTGTAG 1440 

AAGGTGACAA CGCTTAATTT ATACAAATAG TAATTACCTA GGAGGGCGGG GCTTAGCCCG 1500 

GCTCTCCTAT TTTCAAAAAA TATAGGAGAA TTAAAATGGC AGAAATTACA G 1551 

(2) INFORMATION FOR SEQ ID NO: 60: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1007 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



CTATATTTCT GACTATCCTA AAAACAATGG TGAAGCAACT AGTCCAGCAG CCGGGAATAA 60 

TACAGGTTCT AAATACCCTT ATACTATTGA TGTGACAGGC GAGGTTGGTG ATTTGAAACA 120 

AGGTTTTTCT GTCAACATTG AGGTTAAAAG CAAAACTAAG GCTATTCTTG TTCCTGTTAG 180 

CAGTCTAGTA ATGGATGATA GTAAAAATTA TGTCTGGATT GTGGATGAAC AACAAAAGGC 240 

TAAAAAAGTT GAGGTTTCAT TGGGAAATGC TGACGCAGAA AATCAAGAAA TCACTTCTGG 300 

TTTAACGAAC GGTGCTAAGG TCATCAGTAA TCCAACATCT TCCTTGGAAG AAGGAAAAGA 360 

GGTGAAGGCT GATGAAGCAA CTAATTAGTC TAAAAAATAT CTTCAGAAGT TACCGTAATG 420 

GTGACCAAGA ACTGCAGGTT CTCAAAAATA TCAATCTAGA AGTGAATGAG GGTGAATTTG 480 

TAGCCATCAT GGGACCATCT GGGTCTGGTA AGTCCACTCT GATGAATACG ATTGGCATGT 540 

TGGATACACC AACCAGTGGA GAATATTATC TTGAAGGTCA AGAAGTGGCT GGGCTTGGTG 600 

AAAAACAACT AGCTAAGGTC CGTAACCAAC AAATCGGTTT TGTCTTTCAG CAGTTCTTTC 660 

TTCTATCGAA GCTCAATGCT CTGCAAAATG TAAAATTGCC CTTGATTTAC GCAGGAGTTT 720 

CGTCTTCAAA ACGTCGCAAG TTGGCTGAGG AATATTTAGA CAAGGTTGAA TTGATAGAAC 780 

GTAGTCACCA TTTACCTTCA GAATTATCTG GTGGTCAAAA GCAACGTGTA GCCATTGCGC 840 

GCGCCTTGGT AAACAATCCT TCTATTATCC TAGCGGATGA ACCGACAGGA GCCTTGGATA 900 

CCAAAACAGG TAACCAAATT ATGCAATTAT TGGTTGATTT GAATAAAGAA AGGAAAAACC 960 

ATTATCATGG TAACGCATGA AGCCTGAGAT TGCTGCCTAT GCCAAAC 1007 

(2) INFORMATION FOR SEQ ID NO: 61: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1591 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 : 



CTCGCCAGAG AACTCATCCA CATAGATATT GCCAAGTCCA GCTACCAAGG TCTGGTCTAG 60 

GAGATGGGAT TTGATAGGCT TTTTGGACTT GGCAAGGGCA GCTTGAAAGA CCTGTAAATC 120 

AAAGTCTTGT TCGCTTGGTT CAGGACCTAA TTTTTTAGAA ATAAAGTAGG CGTCTAAAAG 180 

GTCAGGCACC AAGAGTTCCA TGGTTCCAAA CTTGCGTACA TCCTCATAAA CAAGCGTGCC 240 

CCCATCTTCA AACCGGAAGA AAACATGGGC ATGCTTGCGT TCAGGCACTT GGTCTGGATA 300 

ATAAAAATAC TTGCCCTCCA TCCGCAAATG GGAAATCAAG ACCTTGTCTG TCAGGCAGAA 360 

AAGCAAATAT TTTCCACGAC GTCCCATTGA CTCGATAATC TGACTAGGCA ATTCCCTTTG 420 
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AAACTCTTCC AAATCCGTCT TAATCATCTT GGGGTAGCGA ATTTCTATAC TCGAAATCTT 
CTTTCCTATA ATCAATTTTT CTAAGCTACG ACAAACGGTT TCAACCTCAG GTAATTCAGG 
CATAAGTCCT CCTTCTGTAA AAACAAGAAG CAGGCATGAG CCTACCTCTA CTTAGTATTC 
TCTTTCATTA TAGCCCAAGT CAGCCAAATC TAGCTTTTTA TCGCGCCAGT TTTTCTTGAC 
CTTGACCCAG GTTTCTAGGA AGACCTTGTC TCCTAGCATG AGTTCGATAT CACGACGGGC 
CATGCTACCG ATTTTCTTAA GCATAGCGCC ACCTTTACCG ATGATAATCC CTTTTTGGCT 
ATCGCGCTCG ACCATGATGG TTGCACGGAT GTGAACCTTG TCTGTCTCTT CGTCTCGTTT 
CATAGAGTCA ACAACTACTG CTACAGAATG CGGAATCTCT TCACGAGTTA GGTGCAAGAC 
TTTCTCGCGA ACCATTTCTG AAACTAAGAA ACGTTCTGGA TGGTCTGTGA TTTGATCAGA 
CGGGAAATAT TGGAAACCTT CATCCAGATT TTCACTCAAA ATATCCACTA GACGAGACAC 
GTTATTTCCC CTGAAGGGCT GAGATTGGAA CAATTTCCTT AAAGTCCATT TGATTACGGA 
AGTCATCAAT CTGAGACAAG AGCTGGTCTG GATGGACCTT ATCGATTTTA TTCACCACCA 
AAATCACAGG AACCTTGGCA GCCTTGAGAC GCTCGATAAT CATATCGTCC CCCTTACCAC 
GCGATTCATC AGCAGGCACC ATGAAAAGAA CAGTGTCCAC TTCGCGAAGG GTACTGTAGG « ou 
CAGACTCAAC CATGAAATCT CCGAGAGCTG TTTTAGGTTT GTGAATCCCT GGTGTGTCGA 1320 
TAAAGACAAT TTGCTCCTTA TCAGTCGTGT AAATTCCCAT GATTTTATTG CGCGTTGTCT 1380 
GCGCCTTGTC ACTCATGATG GCAATCTTTT GCCCCATAAC GTGATTTAAA AAGGTTGACT 1440 
TCCCAACATT GGGACGTCCT AAAATGGCTA CAAAGCCTGA TTTAAAAGTC ATAATTTCCT 1500 
CTTACTGTGT AAAATAATAA ATCCCAGAAT CGTGGGCCAG AAAAATCAAA GCGCCTGTTA 1560 
AGGAAGCAAA AAGAGAAACC ACTAATACCG C 



480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



1591 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 1652 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

GATCTGGAGT GGATGTTGAT TTGAAATTGC CACCAGAAGA AAGTTCTCCT GACTTGGTTA 60 

TCAACGGAAA GGCACCATTT GCAGTGTATT TCCAAGACTA CATGGCTAAG AAATTGGAAA 120 

AAGGAGCAGG AATCACTGCC GTTGCAGCTA TTGTTGAACA CAATACATCA GGAATCATCT 180 

CTCGTAAATC TGATAATGTA AGCAGTCCAA AAGACTTGGT TGGTAAGAAA TATGGGACAT 240 

GGAATGACCC AACTGAACTT GCTATGTTGA AAACCTTGGT AGAATCTCAA GGTGGAGACT 300 

TTGAGAAGGT TGAAAAAGTA CCAAATAACG ACTCAAACTC AATCACACCG ATTGCCAATG 360 

GCGTCTTTGA TACTGCTTGG ATTTACTACG GTTGGGATGG TATCCTTGCT AAATCTCAAG 420 

GTGTAGATGC TAACTTCATG TACTTGAAAG ACTATGTCAA GGAGTTTGAC TACTATTCAC 480 

CAGTTATCAT CGCAAACAAC GACTATCTGA AAGATAACAA AGAAGAAGCT CGCAAAGTCA 540 

TCCAAGCCAT CAAAAAAGGC TACCAATATG CCATGGAACA TCCAGAAGAA GCTGCAGATA 600 
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660 
720 
780 
840 
900 
960 



TTCTCATCAA GAATGCACCT GAACTCAAGG AAAAACGTGA CTTTGTCATC GAATCTCAAA 
AATACTTGTC AAAAGAATAC GCAAGCGACA AGGAAAAATG GGGTCAATTT GACGCAGCTC 
GCTGGAATGC TTTCTATAAA TGGGATAAAG AAAATGGTAT CCTTAAAGAA GACTTGACAG 
ACAAAGGCTT CACCAACGAA TTTGTGAAAT AATGACAGAA ATTAGACTAG AGCACGTCAG 
TTATGCCTAT GGTCAGGAGA GGATTTTAGA GGATATCAAC CTACAGGTGA CTTCAGGCGA 
AGTGGTTTCC ATCCTAGGCC CAAGTGGTGT TGGAAAGACC ACCCTCTTTA ATCTAATCGC 
TGGGATTTTA GAAGTTCAGT CAGGGAGAAT TGTCCTTGAT GGTGAAGAAA ATCCCAAGGG 1020 
GCACGTGAGT TATATGTTGC AAAAGGATCT GCTCTTGGAG CACAAGACGG TCCTTGGAAA 1080 
TATCATTCTG CCCCTCTTGA TTCAAAAGGT GGATAAGGCA GAAGCTATTT CCCGAGCGGA 1140 
TAAAATTCTT GCGACCTTCC AGCTGACAGC TGTAAGAGAC AAGTATCCTC ATGAACTTAG 1200 
CGGTGGGATG CGCCAGCGTG TAGCCTTACT CCGGACCTAC CTTTTTGGGC ACAAGAATCG 
AAATCCCTGA GATGAGGCCT TTAGCGCCTT GGATGAGATG ACAAAGATGG AACTCCACGC 
TTGGTATCTT GAGATTCACA AGCAGTTGCA GCTAACAACC CTGATCNTCA CGCATAGTAT iJOU 
TGAGGAGGCC CTCANTCTCA GCGACCGCAT CTATATCTTG ANAAATCNCC CTGGGCAGAT 1440 
TGTTTCAGAA ATTAAACTAG ACTGGTCTGA AGATGAGGAC NANGAAGTCC NAAAGATTGC 1500 
CNACAAACGT CAAATCTTGG CAGAATTAGG CTTAGATAAG TAGAAANATA GGGAGTTGGT 1560 
GAAGATTATC CCTTTTACCA GCGCCCTTTT TTCTTTTAAA AATGAAGANA ATTTCGGTAT 1620 
AATNNTCAAN TAAGGTCAAG GTTTAAANAN AG 

1652 

(2) INFORMATION FOR SEQ ID NO: 63: 



1260 
1320 
1380 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1256 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

CTAGGGCAAC AAGAATATAT GCAACTTCTA ACATTTAGTT TTCCTCCTCT GTAATATAGT 
AAGGGGCCTT CTTJCGATTT TGATAAATAA CGATCATTAT ACCGAGACCG ATAAAGACAA 
CTGACAGCCA TTGGGACACT CGAAGGCCGA AGAACATGAG ACTATCTGTT CGCATACCTT 
CGATGACCAT ACGACCGAAA CCATACCAAA TCAAGTAAAA GGCCGTGATA TGACCTCGTC 
TGAGACTCTT CCATTTCCGT CTAAAAATCA GAATCAAGGC AAAGCCAAGC AGATTCCATA 
GAGACTCATA AAGGAAAGTC GGTTGACGGT AGCTCCCCTC AATATACATC TGGTCACGGA 
TAAAGCCAGG TAGATAATCC AGATTATCCA CTGTTGCACC ATAAGCTTCT TGGTTAAAGA 
AATTACCCCA ACGCCCCAAA CTTTGAGCAA TCATAACGCT AGGCGCCGCA ATATCTAGAA 
AATCCCAAGT ATTGATGAGT TTACGGTCAG CAAAGATATA GAGCACAAGA GCCCCAGTTA 
TCAAACCACC GTAAATGGCC AAACCACCAT TCCAAATGGC AAAAATCTCT CCTAAATTCT 
GACTATAGTA ATCAAATCGG AAAATAACAT AGTAGAGACG AGCTCCTAAA ATAGCCAAGG 
GAAAGGCTAC TAAGATAAAA TCTAAAATAT CGTCTGGTAT GATCTTCTTT CTAGGTGCTT 

131 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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CTTTCATGGT CAAATAAACC GCAAGAATCA 
GAATGGCTAG GGGTCCTAGT TGAATAGCAA 
GCTATGAGAC TTGTCAGTCG TTCGTCGAAC 
GCACGATAAT TCATGGCAGC TGCCTCAATC 
GGAATACGAA TACGAGGAAT GGCTACGCCA 
AGACGATCAA AGGTCTTATG CGTATCGTAA 
GAAGAATCCT TGACAGCACT CGCACCGTAG 
CGAATTTCAA TCAAGTGTTT CAAAATTTCA 
TTGGCAAAGA TATCGACACG GTCATCGGCT 
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AGCCTGTCAC 


AATACATAAG 


GCATACCAAC 


780 


TT\3GATCAAG 


CATTTTGCAC 


CTCATTTCGA 


840 


AAAC GGGT AG 


CATCAAAGCC 


CATTTCCTTG 


900 


ACAACAGAGA 


TATTACGACC 


TGTTTTAACT 


960 


GAAACTTCAA 


GTTCCTCTGC 


ATTGTTTCCA 


1020 


TTTTCCAAAT 


AGACAGCAAG 


CTGAACCTGT 


1080 


AGACTCATAA 


CATCGATAAT 


ACCAACCCCA 


1140 


GCTGGTTCAC 


CCCAGAGAGT 


AATCTCATCC 


1200 


ACCAAACGGT 


GACCACGTTT 


GACAAG 


1256 



(2) INFORMATION FOR SEQ ID NO: 64: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 

(C) STKANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 



AAGAGATGGT 


GCCTGCTAAC 


CTAAGCCAGT 


AATTTTCTTC 


TAACTTTGGA 


AAATTATGTA 


60 


ATACCAGAGA 


CGGACCCTTT 


GTTCGCCTTT 


ACCCAGCCCA 


ATGTAACAGC 


TGACGAATTT 


120 


TAACATCAAT 


GGTCAGCTAC 


AAAATGAGGT 


CGAGCATGCC 


AAGATGAGTA 


AGATTATTGA 


180 


CCGTTATCGT 


CCAGCTGGTG 


AGGGCTTTGT 


CCGTATCGAT 


ACTCAAAACA 


ATATGCCTAC 


240 


GGCAGCGGGC 


CTGTCCTCAA 


GTTCTAGTGG 


TTTGTCCGCC 


CTGGTCAAGG 


CTTGTAATGC 


300 


TTATTTCAAG 


CTTGGATTGG 


ATAGAAGTCA 


GTTGGCACAG 


GAAGCCAAAT 


TTGCCTCAGG 


360 


CTCTTCTTCT 


CGGAGTTTTT 


ATGGACCACT 


AGGAGCCTGG 


GATAAGGATA 


GTGGAGAAAT 


420 


TTACCCTGTA 


GAGACAGACT 


TGAAACTAGC 


TATGATTATG 


TTGGTGCTAG 


AGGACAAGAA 


480 


AAAACCAATC 


TCTAGCCGTG 


ACGGGATGAA 


ACTTTGTGTG 


GAAACCTCGA 


CGACTTTTGA 


540 


CGACTGGGTT 


CGTCAGTCTG 


AGAAGGACTA 


TCAGGATATG 


CTGATTTATC 


TCAAGGAAAA 


600 


TGATTTTGCC 


AAGATTGGAG 


AATTAACGGA 


GAAAAATGCT 


CTGGCTATGC 


ATGCTACGAC 


660 


AAAGACTGCT 


AGTCCAGCCT 


TTTCTTATCT 


GACGGATGCC 


TCTTATGAGG 


CTATGGCCTT 


720 


TGTTCGCCAG 


CTTCGTGAGA 


AAGGAGAGGC 


CTGCTACTTT 


ACCATGGATG 


CTGGTCCCAA 


780 


TGTTAAGGTC 


TTCTGTCAGG 


AGAAAGACTT 


GGAGCATTTG 


TCAGAAATTT 


TCGGTCAGCG 


840 


TTATCGCTTG 


ATTGTGTCAA 


AAACAAAGGA 


TTTGAGTCAA 


GATGATTGCT 


GTTAAAACTT 


900 


GCGGAAAACT 


CTATTGGGCA 


GGTGAATATG 


CTATTTTAGA 


GCCAGGGCAG 


TTAGCTTTGA 


960 


TAAAGGATAT 


TCCCATCTAT 


ATGAGGGCTG 


AGATTGCTTT 


TTCTGACAGC 


TACCGTATCT 


1020 


ATTCAGATAT 


GTTTGATTTC 


GCAGTGGACT 


TAAGGCCCAA 


TCCTGACTAC 


AGCTTGATTC 


1080 


AAGAAACGAT 


TGCTTTGATG 


GGAGACTTCC 


TCGCTGTTCG 


CGGTCAGAAT 


TTAAGACCTT 


1140 


TTTCCCTAAA 


AATCTGTGGC 


AAAATGGAAC 


GAGAAGGGAA 


AAAGTTTGGT 


CTAGGTTCTA 


1200 


GTGGCAGCGT 


CGTTGTCTTG 


GTTGTCAAGG 


CTTTACTGGC 


TCTCTATAAT 


CTTTCGGTTG 


1260 
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ATCAGAATCT CTTGTTCAAG CTGACTAGCG 
CCATGGGCGA CCTTGCCTGT ATTGTGGCAG 
GCCAGAAGGC GGCTGCTTGG TTAGAAGAAG 
GGGGATTTTT TATCTCACAA GTGAAACCAA 
CCAAGGAAGT GGCTGTATCA AGTCACATGG 
ATTTTTTAAG TTCCTCAAAA GAAACGGTGG 
AAGCCGAAAA AGTTATCGAG CAAGTAGAAG 
CAGATATTTA CACGCCTTTG CTTAGACAGT 
TTGCCAAGAG TAGTGGTGCT GGTGGTGGTG 
AATCAACCGA AACCTTAAAA AATCGTTGGG 

(2) INFORMATION FOR SEQ 
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CTGTCTTGCT 


CAAGCGAGGA 


GACAATGGTT 


1320 


nuunl I I viO I 


TCTC T AC CAG 


TCATTTGATC 


1380 


AAAAv-TTGGC 


GACAGTTCTG 


GAGCGTGATT 


1440 


f MINIUM * ^ IV * ^n^* 

LI 1 i AGAATG 


TGATTTCTTA 


GTGGGATGGA 


1500 


CCCACCAAAT 


CAAGCAAAAT 


ATCAATCAAA 


1560 


TTTCTTTGGT 


CGAAGCCTTG 


GAGCAGGGGA 


1620 


TAGCCAGCAA 


GCTTTTAGAA 


GGCTTGAGTA 


1660 


TGAAAGAAGC 


CAGTCAAGAT 


TTGCAGGCCG 


1740 


ACTGTGGCAT 


CGCCCTGAGT 


TTTGATGCGC 


1800 


CCGATCTGGG 


GATTGAG 




1847 



ID NO: 65: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1360 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 



CTTCCTCCAG CAAAATCCAC TGCTGAGAAG CTAAAGGGAG CGTGAGATAG CCCTCTTTCT 60 

CTACTGGTTG GTCTGAAATC CGAGCCTCAG GAAACCAGTC TTGTAGTTCT TTTTCCCTCA 120 

TGTTCTAGCC CTCCACTTTT TGGATGCACC ATGAAACCAA ACTCTCAAGA CGTTCCAGAT 180 

TCTCAGTCAT ATGGAGATAG CCCATAACCG CTTCAAATCC CGTGGACATA CGATAAGTCA 240 

CGACATCTGC ATTTTTAGCC TTTGTGTGGC TATTGGTATT GCGGCCACGT TTGTAGATTT 300 

CTTCTTCTTT TTCCGTTAGG ACCTGCTCCT CCAACATGAG AGCAATCAGG CGAGCCTGAG 360 

CCTTGGCTGA CACATACTTG GTTGCTTCTT GATGGAGTTT ATTGGGTTTG GTCATACCTT 420 

TGAGGATGAG GTGACGGCGA ATATACATAG AATACACCGC ATCCCCCTCA AAGGCTAGCG 480 

CAATCCCGTT AATGAGATTG ACATCAATCA CGTGTCCACC TCACTCCATC CTTGGTATCA 540 

AGGAGCTTAA TTCCTTGAGT AACCAATTGG TCACGGATTT GGTCTGCTGT CTCAAAGTCT 600 

CGATTGGCAC GCGCCTCTTG GCGTTTTTGA ATCAAGTCTT CAATCTCTGC ATCCAAAACT 660 

TCCTCAACAA AGACAATTCC AAAAATTTCT AACATATCTG CAAGAGCTTG CTTGACACTT 720 

GCATCATAGT TCCCTGAGTT GATCCATTTG GCCATTTCAA AGACAACTGT GATACCGTTG 780 

GCAGCATTAA AATCTTCATC CATAGCTGCT ACAAACTTAT CTTTAAAGTT TTGTAACTCT 840 

TGGGCATCCA CGTTTCCTGT AAATGGTTGT TCGTAAGTAT TCTTCAGATA CTTGAGATTG 900 

GTCTCGGCAT CGCGAACTGC CTTTTCCGTG AAGTTGATAG GCTTACGGTA GTGCTGGGTC 960 

GCAAAGAAGA AACGAAGTAC TTGCCCATCA AGAGTTTTAA GGGCATCGTG TACCGTAATG 1020 

AAGTTACCCA AGGACTTAGA CATTTTGACA TTGTCGATAT TGACAAAGCC ATTGTGCATC 1080 

CCAGTTAGTT AGCAAAAGCC TTGCCTGTTT TAGCTTCAGA TTGGGCAATT TCATTGGTGT 1140 

GGTGTGGAAA CTCTAGGTCA GCTCCACCAC CGTGGATATC AATGGTATCA CCTAAAATCT 1200 
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CTGTCGACAT GACTGAACAC TCAATATGCC AACCCGGACG TCCAGGTCCC CAAGGACTAT 1260 

CCCAAGAAAT CTCACCTGGT TTGGAAGATT TCCATAAAGC AAAGTCTACA GGATTTTCCT 1320 

TACGAGCCGT TTCTTCATCG CTACGACCTG AAGCACCTAG 1360 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1297 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

lii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

CTATGAGGTG GCCCTGGTTT TACTGGATAT CCAGATGCCC AAGCTTAACG GCTTAGAAGT 60 

CCTAGCTGAG ATTCGTAAAA CCAGTCAGGT TCCTGTCTTG ATGTTGACAG CTTTTCAGGA 120 

TGAGGAATAC AAGATGAGTG CCTTTGCCTC TTTGGCAGAT GGCTATCTGG AAAAACCTTT 180 

CTCCCTCTCC CTCTTAAAAG TGAGGGTGGA CGCGATTTTC AAGCGCTACT ACGATACAGG 240 

ACGAATCTTT TCTTACAAGG ATACCAAGGT GGACTTTGAA AGCTACAGTG CAAGCCTCGC 300 

AGGTCAAGAA GTGCCTATCA ATGCCAAAGA GTTGGAAATT CTGGACTATC TAGTGAAAAA 360 

TGAAGGCCGG GCCTTGACTC GGTCTCAGAT TATCGATGCC GTCTGGAAAG CGACAGATGA 420 

GGTTCCCTTT GACCGTGTTA TTGATGTTTA TATCAAGGAA TTGCGGAAAA AGCTAGACTT 480 

GGATTGTATC CTCACTGTGC GCAATGTTGG TTATAAATTG GAGCGAAAAT GAAACGAACA 540 

GGTTTATTTA CAAAGATATT TATCTATACC TTCTCGATAT TTAGTGTTCT GGTTATCTGC 600 

CTTCATTTAG CTATTTATTT TCTTTTTCCT TCGACTTATC TGAGTCATCG TCAGGAAACC 660 

ATTGGTCAAA AGGCAACAGC CATTGCCCAG TCCCTAGAAG GGAAAGATAG GCAGAGTATC 720 

GAGCAAGTGT TAGACTTGTA TTCCCAGACT AGTGATATCA AGGGGACCGT CAAAGGTGAG 780 

ATGACCGAGG ACAAGTTAGA AGTCAAGGAC AGTCTTCCTC TGGACACAGA CCGCCAGACA 840 

ACCTCTCTCT TTATTGAGGA GCGCGAGGTG AAAACGCAAG ACGGTGGTAC TATGATTCTC 900 

CAGTTTCTAG CTTCCATGGA TTTACAAAAG GAAGCGGAGC AAATCAGTCT CCAATTTCTT 960 

CCCTATACCT TGCTGGCCTC CTTTCTGATT TCCCTCTTGG TGGCCTACAT CTACGCTCGG 1020 

ACTATTGTTG CACCGATTTT GGAAATCAAG CGGGTGACCC GTCGGATGAT GGACCTGGAT 1080 

TCCCAAGTGC GATTGCGCGT GGATTCTAAG GATGAGATAG GCAATCTCAA GGAACAAATC 1140 

AATAGCCTCT ACCAGCATCT CTTGACTGTT ATTGCGGACT TGCATGAAAA GAATGAAGCC 1200 

ATTCTCCAGC TGGAGAAGAT GAAGGTCGAA TTCCTACAAG GAGCTTCTCA TGAATTGAAA 1260 

CACACCGCTG GCTAGTTTGA AAATCCCAAT CGAAAAT 12 91 

(2) INFORMATION FOR SEQ ID NO: 67: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1043 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 



CTTGTTAGAG TCCTACTCGC GAGAGCAAGT CAACTTCTTT GTGGATCAAG GTGCTGTTAC 60 

CATAGTCCAA AAGGAAGTTC GACGCTCGGC TGCTTATTTT GAAGGAATTG AAGCAAGTAG 120 

ACCTTTGGAG TTAAATCCAG AACAAAGACA GGCGCGTGAT GCGGTTGTGA GTTCTATTGG 180 

CAGTTCTCAA CCTCCCTTTC TACTTCAAGG GATTACAGGA AGTGGGAAGA CCGAGGTTTA 240 

CTTGCAGATT ATCCAAGGTG CCCTGGATAA GGGCAAGACA GCTATTTTGC TGGTACCTGA 300 

GATTTCCTTG ACTCCTCAGA TGACCGAGCG TTTTATAGCA CGTTTTGGGG ACAAGGTAGC 360 

CATTCTTCAC TCAGGATTAT CCAATGGTGA AAAGTATGAT GAATGGCGCA AGGTGGAGCG 420 

TGGCGATGCC CAAGTTGTTG TTGGTGCCAG ATCTGCTATC TTTGCTCCTC TGAAAAATCT 480 

GGGTGTCATG ATTATTGATG AAGAGCATGA AGCGGCTTAT AAGCAGGACA GCAATCCCCG 540 

TTACCATGCC AGAGAGGTAG CTATTTTACG GGCTCGGTAT AATCAAGCAA CTCTGGTACT 600 

TGGATCTGCA ACGCCGAGCT TAGAGAGCCG GGCACGTGCT GGCAAAGGCG TCTATCAACA 660 

CTTACGTCTA ACCCAACGTG CCAATCCTTT GGCTACAATC CCTGAGGTTT CAAGTGATTG 720 

ACTTTCGAGA CTATATCGGA CAAAATGAGA CGTCAAACTT TACGCCTCCT TTGCTAGAGG 780 

CTATCCAAGA CCGTCTGGTT AAAAAAGAGC AGGTGGTTCT CATGCTCAAT CGCCGTGGTT 840 

ATTCTAGCTT TGTCATGTGT CGGGAGTGTG GGACGGTGGA TACTTGTCCC AACTGCGATA 900 

TTTCTCTGAC CTTGCATATG GATACCAAGA CTATGAACTG CCATTATTGT GGTTTCTCGA 960 

AGGATATTCC TCAGGTCTGT CCTAACTGTA AGAGCCGCAG TATTCGTTAC TATGGGACGG 1020 

GAACTCAGAA GGCTTATGAT GAG 1043 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1449 base pairs 
<B) TYPE: nucleic acid 

(C) . STRANDEDNESS : sing-le 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



CTAGTGCGTC CTAGTTTACG GTAAGCCATG TATTCCTCCT TTATTTATCT TTTAATCCAA 60 

GACCCAAATC AATGAGTTTG AGTTTCACTT CTTCCAAACT CTTGCGTCCA AGATTTCGTA 120 

CTTTCATCAT CTCTGCTTCA GATTTTTCTG TCAAATCATG CACAGTATTG ATACCGGCAC 180 

GTTTTAAACA GTTGTATGAA CGCACAGACA AGTCCAGTTC CTCAATCGTA CGATCTAAAA 240 
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TACGGTCGTC AGATTCAGTA TCAGCTTCTT 
TAAGATTTGT AAACAAATCA AGATGTTCTG 
CTTCTGGAAT AATTGTTCCA TTTGTCAAGA 
TGCTACCTAC ACGAGCAGGT TCCACTTGAT 
AATCTACAGC AAGTGTTCCA ACTGGTGCAT 
ATCCACGACC ACTGTTAACA GTCATAGTCG 
AGAGATAATG ATCTGGATTT ACAATTTCAA 
TTACTTCAGC AGGACCTTCA ACATCCAGTT 
TCACTGCAAT TCCTTTAATG TTCAGAATGA 
CTGTGTCAAA CTCATGTAAC ACACCATCAA 
GAGAAGCTAG AAGTACACGA CGAAGAGAGT 
GTGGTTCGAT TACAAACTTG CCATAATCTT 
TTTCAAACTC GATCATTTAG TTACTCCCTC 
ATACACGGCG ACGTTTTGGA GGACGAGCAC 
TTGCTGTTAC TTCAAGACCA GCGGCAGCAA 
CTGGACCTTT TACAGTAACT TCAACTGATT 
CAGCTTCAGA AGCCATTTGA GCAGCGAATG 
TGAGCACCAG CTGATGACCA AGCAATTGCA 
GTGTTATTAA ATGTAGCGTG AATATGAGCA 
CGTTTACGTG TTGGTTTAGC CAAGACTTTT 
TCGCAACAG 

o 

(2) INFORMATION FOR SEQ 
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TCATCACTTC 


AGTTGACTTA GCAATCTCAG 


300 


TCAAAATACG 


TGCTGAAAGC CCTAAAGCAT 


360 


TTTCAAGGGT 


TAATTTGTCG AAACCATCAT 


420 


AGTTGACTTT 


TGTAACTGGT GTATAAATAG 


480 


TATCCTTTTT 


ATTTTCATCA GCAGGTACAT 


540 


CTTTTAGAGA 


AGAACCTTCA CCAATTGTAA 


600 


TATCGCTATC 


TGTCAAAATG TCACCAGCTG 


660 


CGATGATTTT 


TTCGTCTTCA ACGTACGATT 


720 


TTTGCATCAC GTCTTCACGA ACACCTGGAA 


780 


TGTTGATAGA 


TGTCACAGCT GCTCCTGGTA 


840 


TACCAAGAGT 


TGTACCGTAG CCACGTTCAA 


900 


TATTTTCATC 


AATTTTTGTT ATATTTGGTT 


960 


TTAAACGAAA 


AGCAGTGTAA TGCGATGATT 


1020 


CATTGTGTGG 


CACTGGAGTC ACATCACGAA 


1080 


GCGCACGAAT 


AGCTGACTCA CGACCAGAAC 


1140 


TAAGACCGTG 


TTCTTGTGCA GATTTAGCAG 


1200 


GTGTACATTT 


ACGAGAACCT TTGAAACCAA 


1260 


TTACCATGCA 


CATCAGTAAT CATAACAATA 


1320 


ATACCAGATT 


CGATATTCTT TTTCACACGA 


1380 


ACCTCCTATA 


TTATTTTTTC TTACCAGCAA 


1440 






1449 



ID NO: 69: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1167 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE.. DESCRIPTION: SEQ ID NO:69: 



CTTGTGTATG ATTTTCGATA TCCAAAATTT GTTGTTCAGA ACGGCCGTCA AATTTTACAG 60 

TTGAAATTGA ATCTTTCAAA CCGATACGTT CAATCAAACC TTTGGCTAAC ACTTTTTCCT 120 

CTGGCATTAA GTATAATTGC CATTTCAAAC TTGAACTTCC TGCATTGATT GCAATTGTTT 180 

TTGTCATAAC ATGATACCCC TTTTTAAGCG TTTTCAACTA TTATAACAAA ATCTATTTTA 240 

TATTTCAGTA CCTTGAGTCC ATTTTTGAAA ATTTTCTTTA AATTTCATTA AAACACTTGC 300 

ATCTTGCAAG CTAGCAAGTG GATAAACAAA AGGCTCCACT GCTATTTCAC TTTTCTTCTG 360 

TAGGATAAAA ATAGTCTTAG ATTGTTTAGC ATTAGCAAAG AGATTTTCAG GCAGACTAAT 420 

CATAGCAACC AGACTCGCCT CTTCTTTCAG CCATTCTTTT AACAAATCAC TTTGAGGACT 480 

GGTCAACAAA TCACTCGGAG CTAGAAAAAT AGGCGTATCC GTCTGACTTG AGGTACTTAA 540 
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GCCCTTGTTC 


CATGAGCAAG 


TGATGGGCGT 


AAGTATGTTb 


TTGGCTAGAA 


GCAACTGGAT 


600 


GGCGCGACGC 


AACGGCATCA 


TCAGGATAAT AGCCAACAGG 


CAAGTCACTG 


ATGACCACAT 


660 


CGCTTTCTTT 


GAGCATTTGT 


GGACGAACGG 


CATCTCCTTG 


GACAAAGCCA GCCTGCAAAC 


720 


CAATTACATC 


TGCCATGCTA 


GCTGCCAGAT 


CAATCAGCAA 


ATCATCCACT 


TCCATTCCCA 


780 


AATAATCCAC CTTTTTAGTA AGCGAGGTCA AGAAAGTAGC 


GCCCAAAATT 


CCCATCCCAG 


A An 

OHXJ 


AACCCATTTC GAGGATAGTA ATTTCCTCCT 


CTTTAAACAG 


CTCTTCCACA 


ATAAACACCA 


900 


AAAGTAAAGC 


AATAGCATCC 


GGTGTAAACT 


GGTGATTGGC 


CTGCAAGGGT 


TCTGTTTGCC 


960 


CAGCCTTCAT 


CAAGAGAAAC 


TGGTAGGTCT 


TGAGCCATTC 


TTCTTTGCGT AGTGCTAAAC 


1020 


GCTTAAGGGT 


CTGATTGTTG 


TCTTTGACCT 


GTTCTAGCTC 


AGTCTCACCA 


TCCAGATAGA 


1080 


TGCTGTTTTG 


CTCCACCAAG 


GCGTCATAAA 


AGTTGGNCGC 


CAAATCACTT 


TGGATGACTT 


1140 


GGACATNCNC 


TAGTAAATAG 


GTATAAG 








1167 



(2) INFORMATION FOR SEQ ID NO: 70: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 70: 



TTTTTACCAC TTCACCGGAG TTTTTCTTCC TTAACTTCCA TCAGGATTAA TCGCTGTAAA 60 

GATACGTTTC TTTAACCAGT TTTTCCTTCT TGTTCNACAC GAGTTTCACC TAGAAACAGT 120 

GTTGAATCTT TTTTCTCAAC TGTCTTGAAG GCCAAATCTT TTTCAACAAA ATTTCGAGTT 180 

GTGGGGAAGA TCTTTCTTGT AACAGCAGCA ACTGTCTTTC TCCAGAAACT GGTTTTTCCC 240 

TTAGTCAACT GGATACCGGT ATTCCTTAAC TTGTTTTCCA CTTTCTGAAA CGAGGCGAAC 300 

AAGTACTGGA AGGCAATCTT CTCCACTATC TACCACAGTT GAAGCTACTT GATTGTTTTC 360 

TTCAACTGAG ACTTTTGGCC GTTGACCTTT ATAGGTAATT TGATAGTCTT GACGATTTTC 420 

AGCGAAATCA GCAAGTTCTT TTCCATCTAC AAGAATCTTC GATTGCGTGC TTTCTTGAGG 480 

CAATTCACTT GGTGCAAGGA AGGTCATCTC AATCATCGCA ACACCGCTCT TATCTGCTTT 540 

ACGCTCCATA CGCCATCTCA TAGCTTTGGC TTTGACAGCT TTAAATGTTA CGTTGATTTC 600 

ATCACCAGCT GCGATGTCTT TATCCGCACG ATAAGGCACA GCTTCCCAAT TTTCTGGATT 660 

GTTGAATGGA TGGTCTGCGT CGTAGGCTTG GTAGTTTGAA TAGTAGGTTC GCACTTCAAA 720 

CTCTGGACCG ACATAGCGTT CTAAAACGAG TTTAGTTGGT GCATCCGTAC CACTATCTGC 780 

AAAGAAGTGA AGTTTGGCTT GCGCAACAGT CCGTTCTACA ATCTTACCAT TTTCACGGAA 840 

GATCACACCC GCTGATACTT CTGGATTAGA AGATGGTGTT GGAGACCAGT TTGTCCAACG 900 

ACGATTTTCT GAATGATCTC CGTCATTGAG ATAGTCAACG CGGTCATGAG AGTTTTTGTC 960 

AATATCATTG GTTGCTGAAG CAAAGGCCTG GTTACTGTTT TCATCATAGT TAGGGTTATC 1020 

TGAAAGAGCT TCGCCTAGTT TGTCTGTCAC TCGTACAGTG ACCTCAGCAA CAAGATCACT 1080 

ACCAAGGACA TGGCCTCGAA CGGTAAATTG ACCTGCTTTT GTCAGATTTT CTGCTGGAAC 1140 
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TTCTTCCCAT TCAACTGACA AATCTTTTGT 
TGGAACCTTA GTCGGCAATT CAAGTGCTTG 
CGCAGCAACT GGTTTATGAG AAAGTAAGTT 
TCCTAAGATG TCGCCATTTT CAGCTTTCGC 
GCTTGGAACG ACGGTAGCGA GACCATTGTT 
TCCCATCTAC AGACTCAAGG TAGTATCTGT 
CCGTCAACTT GGATTCTTGT TGTCCTTGCT 
GTACCTCTGT GATAACGTTC CTAATTTGTT 
CATAGGTTTC AACTTTATCA AGAG 

(2) INFORMATION FOR SEQ 
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TTCGTAGCCG TCTTTACCTG TGAAGTAAAC 1200 

ACCTACTTGT AGCAAGCGAG CTTGTTTAAC 1260 

CTTATCCTTA GTGAAGTGCA GACGGTATTC 1320 

GATGACACGA ACTGGCTCAC CTTCACGAAC 1380 

GCTAACACTT GGCTGTGACT GCCGGAACTT 1440 

CAGATCAGGT TGAAGTTTGC TAAGTCTTTA 1500 

TGGCTGCCGC AACTTGTTTC GCAAAGATTT 1560 

GTCTGCTCTC ACCATGGCGA ATACGAACAG 1620 

1644 

ID NO: 71: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 



CTCCCAGCAT GTACTCACCT GTTTGGTAAT TCTTATAAAG CTGATGACGG CTCCTTCTTG 60 

CACCAATTGC TCAATCCCCT TGTGGAAGGA TTTTTGCTTC ATAACATTCT TAGCAGAAAC 120 

TTTCATGAAA ATTTCAGGAG TAAAGGTTGG CAGTGGTTCA AATTCAAACT TGTTTTTTCC 180 

AACCGTCAAG GTATCCCCAA CCTGATAAGT ACCGGTATCG TAAACCCCGA TAATATCACC 240 

TGCTACGGCA TTGGTCACAT TCTCACGACT CTCCGCCATA AACTGGGTAA CATTAGATAG 300 

TTTGGCACCC TTACCAGTAC GAGGGAGATT GACACTCATG CCACGCTCAA ATTCGCCTGA 360 

TACGATACGG ACAAAGGCAA TACGGTCACG GTGACGAGGA TCCATGTTGG CTTGGATTTT 420 

AAAGACAAAG CCTGAGAAAT CCTTGTCATA AGGATCCACA ATTTCACCGT CTGTTTTCTT 480 

GTGACCATGT GGTTCTGGAG CAAACTTGAG GAAGATTTCA AGGAAGGTCT GCACACCAAA 540 

GTTTGTCAGG GCTGAACCGA AAAAGACAGG CGTCAATTCT CCAGCCAGAA TAGCTTCCTC 600 

TGAAAACTCA TTCpCAGCTT CATTTAAAAG CTCAATGTCA TCCTTGACTT GCTCGTAGAA 660 

AGGATTGCTA CCAAAAAGTT TGTCTCCATC TTCTAGGGCT AGCAAAACGC TCATCCCCTT 720 

TGTAAAGCTC TAAACGTTGG TTATAGAGGT CATACAAGCC CTCAAAGGCT TTCCCCATCC 780 

CGATAGGCCA GTTCATAGGG TAGCTAGCAA TGCCCAAGAT TTCTTCCAAT TCTTGCAAGA 840 

GATCCAAAGG CTCACGACCG TCACGATCCA ACTTGTTCAT AAAGGTAAAG ACTGGAATGC 900 

CACGATGTTT CACAACCTCA AACAATTTCT TTGTTTGAGC CTCGATCCCC TTGGCAGAGT 960 

CCACGACCAT GACCGCAGCA TCCACCGCCA TCAAGGTACG ATAGGTATCT TCTGAGAAGT 1020 

CCTCGTGCCC TGGCGTGTCT AAGATATTCA CGCGCTTCCC GTCGTAGTCA AATTGCATAA 1080 

CAGATGAAGT AACAGAAATC CCACGTTGCT TCTCGATATC CATCCAGTCA GATTTAGCAA 1140 

AAGTCCCTGT TTTCTTCCCT TTTACCGTAC CAGCCTCACG AATCTCACCC CCAAAGTAGA 1200 

GTAACTGCTC AGTGATGGTT GTTTTCCCCG CGTCCGGGTG GGAGATAATG GCAAAGGTAC 1260 
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GACGTTTCTT AATTTCTTCT TGAATATTCA TAAGTTCTCT TTCTTTGATT CTCTATTTTC 1320 
CTTGTTTCAA TAGCTGAGAA TGATTTGTAC ATTGGATTTT ACCATCCTTT CGACACTCCA 1380 
TTATATCGGA TGTTAGCATT TTTTCCAATT TCTATTTCTT TCCACTTCCC CCTCCCTTAT 1440 



TTATAGGA 



1448 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1971 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 



CTCCTAGAGC 
CGCCGCGTTT 
CTGTAATAAT 
CAATCAAACC 
CTGCAAAGTC 
CGTTTGTCAA 
AGAAGGTAAA 
GAGTTCCTGA 
TGGCCCCGAA 
GAGAGGAACT 
GCTACTTTAT 
CCACCAGTCA 
CCACAGATGA 
AGAACCATAA 
TATCTCAGCC 
ATTAGAACAA 
CTATCCTGAT 
AATCCTTGGT 
TGGACTTTAT 
TAATTTGGAG 
GGGAACTATT 
CTGGCAGGCC 
GGATGCCCTA 
TATCGCCCGA 
ACTGGATACC 
GAGCTTAATT 



AACTTGAAGG 
ATCAGCGTAG 
AAGAATCGGA 
AAAGGTAAAT 
AACCGCTGGG 
CTCAGATACG 
GGCTGTTTGG 
AGTGATGACT 
GAATACGAGC 
TAGGTAATTT 
CTACAAACCA 
AGCGTTGAGT 
TACCAATCGC 
GAGATACTGT 
AAGAGAATCG 
AAGCAAGTTA 
GCGGCCCAGC 
ATCATTGGGG 
CCAGTAGACA 
CAGTGGCGGT 
CGTTCCAACT 
TTGGAGATTG 
ATTGAAGCAG 
GCAGTCTTGC 
ATTACAGAGT 
TTGATCTCTC 



ACACCTGATA TAAAGGAAAG 
ACCGCAATGG CTGCATTGTC 
TTTTTAAAGA CGATGAGCAA 
CCTGACAAGA CAGCATTTGG 
AATGAACCTG GCAACAATTT 
AACATTCGGA CACCTTGCAT 
ATAATGTACA TAAAGAAATC 
TCTTTATTAG ACATAATGTC 
ATCAAAGGTA GCAGATGCAA 
AAGATTGTCT AAACTTTCTT 
GATTGCAAAT TGCTGTTGGT 
TGCCTCAACA GTCATATTTG 
TGCTGTACCG TAAGCATTGC 
TGCAGCTAGC CATGCTGATC 
AGGAGGTCTT TGCGGAATCT 
CCAGTGGTCG GGTTTTACAA 
CTTCTCTGAG AGACATTTCC 
GGACTGGTTC TGGTAAATCA 
AGGGGAACAT TGACCTTTAT 
CTTGGATTGC CTATGTACCT 
TGACTTTAGG TTTAAATCAA 
CGCAAGCTAA GGATTTTGTC 
GGGGGCGAAA TTTCTCAGGT 
GCCAAGCTCC GTTTATCATC 
CCAAGCTCTT GAAAGCTATT 
AACGAACCTC AACTTTACAG 

139 



GATAACAGCC 
AAAGAACACT 
GACAATTGTA 
AGAACCAAAT 
GTTTGAAATA 
CAAAACGAAC 
TTGTTTAGCA 
TGGACCCAAG 
CAACTGTATC 
CTTTCTTACC 
GACCAATCGC 
AACTAACTGC 
GCAATTGTGG 
AATTCCCTCA 
CCCGAAAACA 
GTCCAAGAAT 
TTTGATATGA 
AGCTTGGTGC 
CAAAATGGAC 
CAAAAGGTCA 
GAAGTATCTG 
AGTGAAAAGG 
GGACAAAAAC 
CTAGATGATG 
AGAGAAAATT 
ATGGCGGACC 



GCTTTCCATC 

GGTACAAATC 

ATCAATTGAC 

CCATAAGAAG 

CCTTGGAAGG 

AAGTAAACTG 

GGATTGAATA 

ATTAAAAGAA 

GTGGAAGATT 

AAAGCGTCCT 

AAATCCGCCA 

CCAGTAAAGT 

TACTAAGAAT 

ACCAGTCCTA 

TCCATTCAGA 

TGACTTTTAC 

CTCAAGGACA 

AACTCTTACT 

GTAGTCCTCT 

AACTCTTTAA 

ACCAGAAACT 

AAGGACTTTT 

AAAGGTTGTC 

CAACCTCGGC 

TTCCAAACAC 

AGATTCTCCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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CTTGGAAAAA GGTGAGTTGC TAGCTGTTGG 
AGTCTATCGT GAAATCAATG CATCCCAACA 
CTGTAAACCA GACGCTCAAA CGTTTAGCCG 
TTCCTAGCCT TTCTAGGAAC TATTGCCCAA 
ATTGGGCAGG TCATTGACCA AGTCCTAGTG 
TTTCTCCAGA TGCTCTTGGT GGTAATAGGA 
CTCTATAATC GTCTAATCTT CTCTTATACC 
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CAAGCACGAT GACTTGATGA AATCCAGCCA 1620 

TGGAAAGGAG GACTAGAATG AAACGACAAA 1680 

TTAGATTTAG CAAACCATCC CTTCCTCCTT 1740 

GTTGGCTTAT CAATTTACCT ACCTATTCTG 1800 

GCTGGTTCAT CACCAGTTTT TTGGCAGATT I860 

AATACTCTGG TACAATGGGC CAATCCTCTC 1920 

AGAGATTTGC GAGAGCGGCT C 1971 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

TNTTTTGCCC NTTGCGCGTC CTTTTAGGAA AACGGTATCG AGGGCCNTGA TCTTATCTTT 60 

TCAATNTATG TANAACCCAG TGAGTTGAAA AAAATTGCCA GTGAATTAGA CACGACGGAA 120 

AACAACCTTG AAAAATTCAA CGAATGACTA TTGGTGGAAA AAATTGACCG TCTACTATAT 180 

GATTGACGAT GCAGGGGTTG GTGTGAAAAT TTTGACATCA AGTAAAGATG CCAATGACTT 240 

GTTAGAGAAA AAAATTGATG GGTTGATTGA GAAATTTAAA CATGCTTATG CAAATGTGAA 300 

AATTGAAAAA CTGGAAACCA TCAACTCTAA AGAAATTGAA CGCAAATGGT GAAACAGCGC 360 

TTGGCTTAAA TGTGAAAAAT CTGATNCTNC GTTTAGTTGT TATTGGAGTG GTTTGTGTGA 420 

TTTGGGTNGT GATGGGAAAT GTGTTAGTTT ATCTCTTTAA TCCACCAATC ANTAAANTAG 480 

GTGATTTTCC CCNCCTACCN NNTNAATTTT GTTNCCAAAA NTTAANNCCA TTGCCNCCCT 540 

NCCCAAATTT TTTGTNCTNC CCCNATTTCC NNACCGGAAT TTGTTCCCNN NNTATTTAAC 600 

CCCCCCCCCC CCCCCCTTTT 62Q 

(21 ^FORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
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CTGGTCCTNG 
CACTCGTGCC 
AGAAATGACA 
GTCAAGGACG 
CTACTTCCTT 
CCAAGGCAAT 
CCTTGTGCCC 
CGCTCCAGAG 
TGAAGGGCTG 
CTCAAACGAT 
TCACCCAGGG 
GCACGACGTT 
ACAACGACAT 
AAGTAAATAG 
TTTACTCCGA 
AGTTTTAAAT 
GAAGACAAAC 
ATCACGTGCC 
TGCTCTTTTT 
TTAGCTGAGT 
AAGGAAATAG 
ATTAATGCGG 
GTCTTTGCGA 
CGTTAAGCCA 
GTCAATCCTT 
ACTAACCTAA 



AGGTCAATCA 
AAAACACGAT 
TGTGTTACAA 
CGAGTGCCAC 
GTAGAGTTTG 
GGCACAGCCA 
AAGCTGTGTG 
AAACGTGTGA 
CTTGAGCTTT 
TTTCATCTTT 
TCTGAATATC 
CATTGTATTT 
TGTCCTTCAA 
CACGTTGGTC 
TAAAGTTATT 
CAAAGAGATG 
CACCACCATT 
CAGCTTCTTG 
CAAACTTGAG 
AGAAACGCAA 
CACCAGGTAA 
CCTGGTGAAA 
AGGGCTTCAG 
TCTTAACGAG 
TTTACTGAAA 
TCAAGGAGGA 



ATTCCCCCAG 
TGGTCAGGTA 
ATTNATCTAC 
CTGAAACTTC 
CCTACAGCTT 
CCAAAACCAG 
AACAAGAGTA 
CGCATTCATC 
AAGGGTACGT 
AATCAGATAG 
CAAGGCAACT 
AGAGTCCGCC 
ATCAAGTGGC 
AGCCCCCATA 
TTCTGTTTGT 
CTCTGCCACA 
TGGGATATTC 
CAAGAAATGA 
GTCAGCGAGA 
GACTTGGTCG 
AAACGTGCCC 
AGAAGGTTTG 
TAGTAAGATG 
TAATCGTTTT 
TTTTAGTAAA 
TTTATGGCTA 



ATAGGNCTCA 
GATTCGATCC 
TAAGGTCACT 
AGCGATATAG 
CCTTAAAGGC 
CCCCTGTCAT 
TCCAATTCCA 
AAACGACCAA 
TGGTTTTCAA 
CTATATTGGT 
TGCAATTCTT 
AATTCACGGC 
ACCAAGTCGT 
CCAATAGCAA 
TTTCCGATTT 
ACTCCTGTCA 
CCATAAACAT 
AGGACACCTT 
GGCACTTCGA 
TCACGCTTAC 
ACCGTTGTAG 
GTCTGCTTCT 
TTGTGCCATA 
CTTTCTATTG 
ATATCAGTAA 
CCTTAAAAGA 



ACTCCCAAAA 
ATCTCCTCAA 
ATAGAGCCTC 
AAGGCTNGGA 
CTCAACAGTA 
ACGAGCACCG 
AACCAGTTAC 
ATGTTTCCAA 
GCACAGCATG 
CAACGGCCCA 
CCACTGCTTT 
GTTTGTTGGT 
ATTCTAAAGT 
ATTGGTCCAT 
TAACCAAATC 
AGAGCTCCAA 
AAAAATCAAA 
TTGGATAATT 
TAATGCCCTT 
GAGCTGCACC 
TCTGTATGTT 
TGACCAAAAA 
TGAATTCTCC 
TATCCAAGGA 
AAAGAAGAAA 
CATTGCACAG 



CACCTCCTCC 
ATGAANTTTC 
CNTTNGACTA 
GGGTATCCAA 
TCTTTTTGAA 
AGAACTCCTT 
TTCATAATCA 
ATCACCTGCT 
GCGAGCACGT 
CTCGTCCAAT 
TTCACACTCA 
GTTCATGATA 
ATTGGTATCT 
AATGCCAGAA 
GAGACGCTCT 
GGATGCTGAA 
ACCTTTGTCA 
GGTCCAGTTG 
GTCCTCAAAG 
GTAAGTTCCC 
CACCAATCAA 
CAGCAAGAAA 
TTTGACTGTC 
ATTTTCCGAG 
AACATGATAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1550 



(2) INFORMATION FOR SEQ ID NO: 75: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1139 base pairs 
<B) .TYPE: nucleic acid- 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 



GTGGCGGCCG CTTAAGAATT AGTGGGATCC CGGGGCTGCA GGATTTGAAC TCAATGCATC 60 

CAAGGCGTCA TCCATTGAAA GAACTTCGTG GAAGACACGT TGTGTCAATT CAGTTTTTTG 120 

TTCTGGAGTG AGGTATTTAG TGTTTACACA GTATCCAGAG ATACGTACGA TAACGTCTTC 180 
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GCCTGACATG ATTTTTTCGT AAACATCGTT 
TCCACCGTTT TCGAAGTAAC CATCAAGGAT 
AGTCTTACCA AGAGCGCGAG GTGATACTTG 
ACTAAAGTCA AGGCTAGAAA GTGAGTTCAA 
AGATGGGTTA GCACCTGGTG AGAAGAATTC 
GTTGAGGTAT ACACCTTTGT GAACTGGTGA 
AGATGTGATT GTCAAAAGTG ATACTGTAGC 
TAAACGAGTT GTGTAAGCTT CGATTAACCA 
TTCACCCCAG CGTGGGTAGT CACCGATTGT 
ACGGATTGGT TTAACTGTAG CGTATTTGAT 
TCCACAGATA CCGAATCCCA TGTTGGCACG 
AGCTTCGTAG TTGTACCTAT CAGTCATGTT 
TGTCAGTCAA CCAGTCAAGA GATTTTTCAA 
CTTCGTCACG GATTGGTTCG ATATCAAATA 
CACCATTCAA ACCAGTAAGA AGGGCTTTAA 
TGTTGTGGCG TTGTTCTTCA TTTTCTGGAT 

(2) INFORMATION FOR SEQ 
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CAAGTCCATA ACGTTCAAGT TAACGTGTTG 240 

TGTTACCAAG TTATCAACTT GTTCATCACG 300 

TGTAGTCAAT GAGATACCGT CAGCTGCATA 360 

GTTTTGCAAC CAACCACCTT TAGCTTTGTT 420 

AAGTTTAGAC AAGTTCACAG AACCATCTTC 480 

GTTACCAGTT TGTTTAGAGT AAGCAACGTT 540 

TTCTGCGTCT TTGTATAGTT TGTGGCTACG 600 

TTCTGCCAAT TCGTTTGAAC GTGGGTCATC 660 

TTCGTAATCG TAGATGTAGC CATCTTCGTC 720 

AGCTGACAAT GTATCAACAG TGTTAGCAAA 780 

TTGTTTAGTT GGCAAGAAGG CCATTTGAAC 840 

AGTGGATGAT GTTCAAGGCA TCTACGTAAG 900 

AGTTCGCTTT AACTGATTCA AATTCAAGAA 960 

CTTTGTAGTC TTTGTGAACA TCGTCGTAAC 1020 

GAACGTTTAC ACGAGCACCG AAGTACTGGA 1080 

CAAGTGGAGA CACACAGCAT GAGATACAG 1139 

ID NO:76: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1326 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 



CAAAACCAGT 


CTTTCCTTGG 


CTGAGAAAAA 


CAGCTGTTAA 


GGCTCCAACC 


AAGTCTCCTG 


60 


TCCCTGTTAT 


CCAGTCTAAT 


TCAGTACAGC 


CATTCTCAAG 


TACAGCAACT 


TGATTCTCCG 


120 


AAACAATAAG 


GTCCTTGGGA 


CCTGTGACTA 


AGAATGACAT 


ACCAGGATAG 


GTCTGACACC 


180 


AGTCTTTCAA 


QACTTGAAGC 


AAATCCTCCG 


TTTCTTGATC 


TTTAGCACTC 


GCATCGACCC 


240 


CAACGCCGTG 


ATGCTTTAAT 


CCAACAAGAC 


TTCGAATTTC 


TGACATGTTT 


CCTTTAAGGA 


300 


CCGTAGGTCT 


ATAGTCTAAA 


AGGTCTTTAA 


CTAAGCTCTT 


ACGAATGGAT 


GAAGTCGTTA 


360 


CGCCAACCGC 


ATCTACTACC 


ATCGGGAGAG 


AAGATTGGTT 


TGCATACAAA 


GCTGCCATGC 


420 


GGATTGCTTT 


TTCCTT 7TC A 


GCTGACAAAT 


GCCCC^AATT 


GATGAAGAGA GCCTGGCTTT 


480 


GCTTAGTAAA 


ATCAAC V*CT 


TCACGGGGAT 


CATC? jCCAT 


GACAGGTTTG 


CATCCCAGAG 


540 


CCAAAATCCC 


ATTTGCCAGC 


ATCTCACAAG 


AAATCTCATT 


GGTCATACAG 


TGAATGAGGG 


600 


AACTAGAGCC 


TATAGGAAAA 


GGATTTGTCA 


ATGCCTGCAT 


CATTCTATCC 


TTTCAGCAAA 


660 


GAAATATCCT 


TGCACTTTTT 


TAAAGAATTC 


CTGCTTGATT 


AAAAATCTAA ATGCAATAAA 


720 


GGAAATCGCT 


GTACCAATCA 


AGGTTGCTCC 


GAAAAATCGA 


GGCGTGTAGA 


TAAACCAACT 


780 


AAGCTTAGCA 


GCCGATCCTG 


TAAAGAGCAC 


CATAACAGGA 


TAGGAAACAA 


TAGAACCAAT 


840 
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AATACCTGTT CCCACAATTT CTCCCAAGGC 
AAAGAGACCT GCTAGAAGGG CTCCAAAAGT 
GAATACCCTT GAGTCGTCAT ACGGATAAAG 
ACAGGTCCCA TCATGATTCC CGCTAGAATA 
TCAATCCGAA AGATAGGTGT AAGGACTACA 
AATTTGTGAA CTTGTAGTTG GTGCTTTCTC 
CTGTAAATCG CTCTTCCATG TCTGGTGTTG 
ATGAACACTG ATGTGGAAGG CATCTAGCAT 
ATAGAG 

(2) INFORMATION FOR SEQ 




PCT/US97/07950 

AGAAAAGTAA AATTTTCGAC CGTACTTATA 900 

CGCTCCTGTG AGAGATAAAG GAGCTTATCG 960 

GCTGTCACTG TAGCCATAGC CAAGGCATAA 1020 

TTGACTACAC TGGACATCGG TGCCATTCCC 1080 

TCAAGGGCAA TCATCATAGA TAAAATGGTC 1140 

AAGTTTCTAT TCTTCTCCTT TTTCTAAAGA 1200 

GTAAGCCATC TCCCAAAACT TGGCTTCCAT 1260 

TTTTTGCTTA TCTGTCTCAT CACTTTCTCG 1320 

1326 

ID NO: 77: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



CTAGTCCAGA GGGTGCAATG GCTAGTGATG GTACTTGGCA AGGAAAACAG TATCTGAAAG 60 

AAGATGGCAG TCAAGCAGCG AATGAGTGGG TTTTTGATAC TCATTATCAA TCTTGGTTCT 120 

ATATAAAAGC AGATGCTAAC TATGCTGAAA ATGAATGGCT AAAGCAAGGT GACGACTATT 180 

TTTACCTCAA ATCTGGTGGC TATATGGCCA AATCAGAATG GGTAGAAGAC AAGGGAGCCT 240 

TTTATTATCT TGACCAAGAT GGAAAGATGA AACGAAATGC TTGGGTAGGA ACTTCCTATG 300 

TTGGTGCAAC AGGTGCCAAA GTAATAGAAG ACTGGGTCTA TGATTCTCAA TACGATGCTT 360 

GGTTTTATAT CAAAGCAGAT GGACAGCACG CAGAGAAAGA ATGGCTCCAA ATTAAAGGGA 420 

AGGACTATTA TTTCAAATCC GGTGGTTATC TACTGACAAG TCAGTGGATT AATCAAGCTT 480 

ATGTGAATGC TAGTGGTGCC AAAGTACAGC AAGGTTGGCT TTTTGACAAA CAATACCAAT 540 

CTTGGTTTTA CATCAAAGAA AATGGAAACT ATGCTGAAAA AGAATGGGTC TACGATTCTC 600 

GAGAATAGTC AAQCCTAGTA CTACTTCTAA AATCCGGTGG CTACATGACA GCCAATGAAT 660 

GGATTTGGGA TAAGGAATCT TGGTTTTACC TCAAATCTGA TGGCAAAATA GCTGAAAAAG 720 

AATGGGTCTA CGATTCTCAT AGTCAAGCTT GGTACTACTT CAAATCTGGT GGCTACATGG 780 

CGAAAAATGA GACAGTAGAT GGTTATCAGC TTGAAAGCGA TGGTAAATGG CTTGGAGGAA 840 

AAGCTACAAA TGAAAATGCT GCTTACTATC AAGTAGTGCC TGTTACAGCC AATGTTTATG 900 

ATTCAGATGG TGGAAAAGCT TTCCCTATAT ATCGCAAGGT AGTGTCCTAT GGCTAGATAA 960 

GGATACNAAA AAGTGATGAC NAGCGCTTGG CTATTACTAT TTCTGGTTTG TCAGGCTATA 1020 
TGAAAACAGA AGATTTACAA GCGCTAGATG CTAGTA 



1056 



(2) INFORMATION FOR SEQ ID NO: 78: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

CTGGACCACA CTATTTTCTG TGTTGGCTAT CGTGTCATGC AAAAGGATCT AGAAGGGACG 
CTGGATGCTG AAAAACTCAA GGCTGCTGGT GTTCCGTTCG GCCCGCTTTT TGGTAAAATC 

AAAAACGGCC AGGATCTTGT TTTGGAAGAC GGAACTGAAA TCAAGGCAGC AGACTATATC iou 

TCAGCGCCAC GTCCAGGTAA GATTATCACT ATTTTAGGAG ACACTCGAAA AACGGATGCC 240 

AGTGTGCGTC TGGCTGTCAA TGCAGATGTC CTAGTTCATG AGTCCACTTA TGGCAAGGGT 300 

GATGAAAAAA TTGCTCGTAA CCATGGTCAC TCAACTAATA TGCAAGCTGC ACAAGTAGCG 360 

GTAGAAGCAG GTGCCAAACG CCTCCTACTC AACCATATCA GTGCCCGTTT CCTCTCAAAA 420 

GATATAAGCA AACTCAAGAA GGACGCTGCC ACAATTTTTG AAAATGTCCA TGTGGTCAAA 480 

GACTTGGAAG AAGTGGAAAT CTAGCAGTCA CAGAAAGGAT AAGTATGCCT ACTATTCTCA 540 

TTACCGGAGC TAGCGGTGGT CTAGCTCAAG AAATGGTCAA ACTCTTGCCC AATGACCAAC 600 

TCATCTTGCT TGGTAGAAAT AAGGAAAAAT TAGCCCAACT CTACGGAAAT TATTCCCATG 660 

CAGAATTGAT TGAAATTGAT ATTACCGACG ATTCAGCCCT AGAAGCTCTG GTAACTGATC 720 

TTTATCTCCG CTATGGCAAG ATTGATGTCT TGATTAACAA CGCTGGTTAC GGGATTTTTG 780 

AGGGATTTGA CCAGATTGCT GATAAAGATA TTCACCAGAT GTTTGAGGTC AATACCTTTG 840 

CCCTGATGAA TCTGTCTCGT CACCTTGCGG CTCGTATGAA GGAAAGCAGC AAAGGGCATA 900 

TCATCAACAT CGTCAGCATG GCAGGTCTAA TAGCTACTGG CAAGTCTAGT CTTTACTCAG 960 

CGACCAAGTT TGCGGCTATT GGTTTTTCAA ATGCTCTGCG ACTCGAACTT ATGCCCTATG 1020 

GAGTCTATGT GACAACAGTC AATCCAGGTC CAATCCGAAC AGGATTTTTT GACCAAGCTG 1080 

ACCCAGATGG AACTTATCTT AAATCGGTTG ACCGCTTCCT CTTAGAGGCA GATGCAGTGG 1140 

CTAAAAAGAT TGTCAAGATT ATAGGCAAAA ATAAACGAGA ACTCAATCTC CCGATTTTGT 1200 

TGAACCTAGC CCATAAGTTT TATACTCTCT TTCCCAAGCT AGCTGATAAG TTGGCAGGGG 1260 

AAACTTTTAA TTATAAGTAA AAAGAACCAA TGTGCAGGTT GTTGCTAGCC TACATATTGG 1320 

TTCTTATCAT TTCTCAACTA TCAAACTGAA CTTCTTCTAG TAGATAGTCG ATAAAGCGTT 1380 
CGCCCATCTT AGATAGGCTG GTTTTCTCAT GCTGGATATA GACCAG 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1702 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

CCAAGCTTGC CCAATTGTCG GCAGGGTTTC GATGGACCAA ATCACTATTC GATTGCCTAA 60 

GCTTTATCCG CTAGGAACCA AGGTAACCTT GATTGGCTCC AATGGGGATA AGGAAATCAC 120 

TGCAACTCAG GTAGCGAACT ACCGCGTAAC CATTAACTAT GAGGTGGTTT GCCTCCTCAG 180 

CGACCGTATT CCGAGAGAAT ATTATTAGAA AAGAAAGGAG TGGAGCATGA ATCTACATCA 240 

ACCCTTGCAT GTCTTGCCTG GTGTGGGACC AAAGTCAGCA GAAAAATAAC GCCAAACTAG 300 

GAATTGAAAA CTTGCAAGAT CTCTTGCTCT ACTTTCCTTT CCGTTATGAA GACTTCAAAA 360 

CCAAGCAGGT GCTGGAACTA GAAGACGGTG AGAAGGCAGT TCTTTCTGGT CAGGTAGTGA 420 

CTCCTGCTAG TGTCCAGTAT TATGGTTTCA AGCGCAATCG CCTGCGTTTT AGTCTCAAGC 480 

AGGGAGAGGT CGTTTTTGCG GTGAATTTCT TTAACCAGCC CTATCTGGCT GATAAAATAG 540 

AGTTGGGAGC AACCCTTGCT GTCTTTGGAA AATGGGACCG CGCTAAGGGC TAGTCTGACT 600 

GGGATGAAGG TTCTGGCTCA GGTAGAAGAT GACCTCCAGC CTGTCTATCG TCTGGCTCAG 660 

GGAATCAGTC AGGCCAGTCT GGTCAAGGTC ATCAAGACGG CTTTTGATCA GGGACTGGAC 720 

CTCTTGATAG AAGAAAATCT GCCCCAGTCT TTACTAGACA AATACAAACT CATGTCCCGT 780 

TGTCAGGCAG TCCGAGCTAT GCATTTTCCA AAGGATTTGG CAGAAAACAA GCAGGCTCTT 840 

CGCCGTATCA AGTTTGAGGA ACTCTTTTAT TTCCAAATGC AGCTGCAGAC GCTCAAGTCT 900 

GAAAATAGAG TTCAGGGAAG TGGTCTGGTT CTGGATTGGT CTCAGGAAAA AGTGACAGCA 960 

GTTAAAGCAA GTCTTCCTTT TGCCCTGACC CAAGCTCAGG AAAAGAGTTT GCAGGAAATT 1020 

TTAACTGATA TGAAGTCCGA CCACCACATG AATCGTCTCC TACAAGGAGA TGTGGGGAGC 1080 

GGAAAAACGG TAGTCGCTGG CTTGGCCATG TTTGCGGCAG TGACAGCAGG TTATCAGGCT 1140 

GCCCTAATGG TACCAACAGA AATCCTCGCA GAGCAACACT TTGAGAGTTT ACAGAACCTT 1200 

TTTCCCAATT TGAAACTGGC TCTCTTGACA GGTTCCTTGA AAGCTGCAGA AAAGAGAGAA 1260 

GTCTTGGAGA CCATTGCCAA GGGTGAGGCT GATTTGATTA TAGGAACTCA CGCTCTGATA 1320 

CAAGATGGGG TGGAGTATGC TCGTCTTGGT TTGATTATTA TCGATGAGCA GCACCGTTTT 1380 

GGTGTAGGGC AAAGGCGTAT TTTACGGGAA AAAGGCGACA ATCCAGATGT CCTCATGATG 1440 

ACGGCGACTC CCATTCCACG GACGCTTGCC ATCACAGCCT TTGGAGATAT GGATGTTTCC 1500 

ATTATCGACC AGATGCCAGC AGGTCGGAAG CCTATTGTGA CGCGCTGGAT CAAACATGAG 1560 

CAACTACCTC AGGTCTTGAC TTGGTTAGAG GGGGAAATTC AAAAAGGTTC CCAAGTCTAT 1620 

GTCATCTCTC CTTTGATTGA AGAATCACAA GCTCCTAGAT TTGAAAAAAT GCCATTGCCT 1680 
TATCAGAGGA GTTGACTACT CC 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



1702 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



60 



CTATGAAGGA CCTGGTTCAG AAGTTGGCAA GTAAAGAAGA ATTGTCGACA GCAGACAATG 

AAATGATTGA ATTATTGGCT CGTTTTAATG AACGCCAAGC CTCCTTTTTT GGTCAGTTTT 120 

CGGTTAGAGG CTATGTTAAC TACGATAAGC ATGTAGCTAA GTATTTAAAA ATCTTGCCAG 180 

ACCAGTTTTC TTATCAAGCA ATAGAAGATG TTGTCAAAGC AGATGCTGAA AAAAATACTT 240 

CCAATAATGA GATGGGAATG GAAAATTATT TCTATAATGA GCAGATCAAG AAGGATTTGA 300 

AGAAATTAAA GGATTCTCAG AAAAGCTTTA CCTATCTCAA GTCGCCAGAG TATAATGACT 360 

TGCAGTTGGT TTTAACACAG TTTTCTAAAT CTAAGGTAAA CCCGATTTTT ATCATTCCAC 420 

CTGTTAATAA AAAATGGATG QACTATGCTG GTCTACGAGA GGATATGTAC CAACAAACGG 480 

TGCAGAAGAT TCGCTACCAG TTAGAAAGTC AAGGTTTTAC CAATATAGCA AATTTTTCTA 540 

AGGACGGTGG GGAGCCTTTC TTTATGAAGG ACACCATTCA CCTTGGTTGG TTGGGGTTGG 600 

TTGGCTTTTG ACAAGGCAGT TGATCCTTTC CTATCCATCC CACACCAGAT CGAAGTCCTG 660 
CAG 



663 



(2) INFORMATION FOR SEQ ID NO: 81: 



U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 



CTTGTCTTTT GATTTTTGGA GGTGTTGATG AAAACATTTC TTGTCAAACA AAAGTTTCGT 60 

CTTGGAGGCG AACGCTTCGC TATCAAGGAT GACAGGGGAG AAATTGCCTA TCAGGTGGAG 120 

GGATCATTTT CTTAAGATTC CCAAAACTTT TACCATCTAT GATGCGGCTG GTGAACAAGT 180 

CAGTCAGATC AGTAAAGAAA TCTTGACCTT GCCTCCTCGT TTTGGAGATT CAGCTTCGGG 240 

ATGGCTCGAG TTTTGTCATT CGTAAGAAGT TGACCTTTTG GCGAGATAAG TATGAGTTTG 300 

ATAATCTAGG TCTTCGTATC GAGGGCAATA TCTGGGATTT GAATTTCAAA TTGCTGGATG 360 

ATCGCGATCA GTTGATTGCG GAAATTAAGA AGGAACTCTT CCATCTGACC TCTACCTATA 420 

CCGTAACGGT TCTTGAAGAC GCTTATGCAG ACCTAGTCAT TTCCCTCTGT GTCGCGATTG 480 

ACTATGTGGA AATACTGGAA AGCCAATCAC ATTAAACAAG TGAATAAGGA GACAATATGA 540 

AACAACTATC TAGTGCACAA GTACGCCAAA TGTGGCTTGA TTTCTGGGCG ACCAAAGGTC 600 

ACTCAGTAGA ACCATCAGTA AGTTTGGTTC CTGTAAATGA CCCAACTCTT TTGTGGATCA 660 

ACTCTGGGGT AGCAACCCTT AAGAAATACT TTGACGGGAC CATTATCCCT GAAAATCCAC 720 

GTATTACCAA TGCCCAAAAG GCTATCCGTA CCAACGACAT CGAAAACGTT AGGGAAAGAC 780 

TGCGCGTCAC CATACCATGT TTGAAATGTT GGGGAACTTC TCTATCGGGG ATTACTTCCG 840 

TGACGAAGCT ATCACTTGGG CTTATGAGCT TTTGACAAGC CCTGAATGGT TTGATTTCCC 900 
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TGCTGAAAAA 
TGAAGTGGGA 
TGCGGGACCT 
AGAAAATATC 
CTGGAACATC 
GGAATTGCCA 
CCAAGGGGCT 
GAAATTGTCT 



CTTTACATGA 
GTGGATCCAA 
TCTGGACCAG 
GGTCTTCGCC 
GTTTTGTCAC 
CATAAGAACA 
AAGACCAACT 
GGTAAGGTTT 



CCTACTATCC 
GTCACTTGAT 
ATACAGAAAT 
TGCTTGCAGA 
AATTTAACGC 
TTGATACGGG 
TTGAAACGGA 
ATGACCAAGA 



AGACGATAAA 
TCCAATTGAG 
CTTCTTTGAC 
AGATATTGAA 
AGACCCTGCT 
CGCTGGTTTG 
CCTCTTCATG 
TGGCGACAAC 



GATTCTTACA 
GACAACTTCT 
CGTGGGGAAG 
AACGACCGTT 
GTTCCTCGTA 
GAGCGTTTGG 
CCGATTATCC 
ATGAG 



ACCGCTGGAT 
GGGAAATCGG 
CCTTTGACCC 
ATATTGAAAT 
GCGAATACAA 
TGGCCGTTAT 
GTGAAGTCGA 



(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1322 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1365 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



CTCCAGGTAA 
ACCCACGTTT 
GTGCCATCGA 
CAGGTAAAAC 
AAAAAATGCG 
AAGATTTGTT 
TTTTAAACAT 
CTTGAAGGCA 
GGGCTTGTAC 
CCTTGGTTGG 
TGCTGCCATT 
AGCGGTTGCT 
TTCAGTTGGT 
GGAGCGTGCG 
TCTTCTCCCT 
GGCTCAAAGA 
TTATCAACAT 
CTGCTGTGTC 
TCTACCTTCA 
ACCCAATCGG 
TGAAAAACTT 
ACAAGGGTCT 



TGTCAAGGCT 

TGGAGAAACA 

AGGCGGCGTG 

ATTGGTCAAT 

TGACTTGGGT 

TGACTTGATT 

GTCTATTATT 

TCCTCGACCA 

AGGTCCTTGG 

TCAAATATCG 

ATCATGGTTC 

ATCC.CTCTTG 

TTGGTTCATA 

CATTTCATTG 

TATGGTACCA 

TGGTATGGCT 

GATGGCAACT 

AGATATTACT 

CCTTTCTAAA 

CGATATCCTA 

CAATTAACTA 

TGGAACTTTG 



AACGTGGTTC CAATTCAAAA 
CATGCCCTTA TCTTGTTTGA 
CCAATCAAGA CTCTTAACGT 
ACCGTTTTGT CTATGGACAA 
GTTGAATTTG ATGTCCGTAA 
AACAAAGCCA ATGTCAAATA 
TCTATGGTTT TAGTAGTCGT 
GTTCCAATTT CACCAACCAC 
AAGCAGGGAT TATCCTCGGT 
GTGCTGCTAT CGCTCCTGAT 
TTGGTGGTGA CTTTACCAAG 
CCGTAGCTGG ACTTTTCTTG 
CTGCAGATGC TGCCGCTAAA 
CGCTACTTTT CCAAGGACTT 
ACTGAAACTG TACAAAGTAT 
ATCGGTGGTG GTATGGTCGT 
CGTGAAGTAT GGCCATTCTT 
CTAATCGGAT TCGGTGCTAT 
ACTGGTGGAA ATGGTGGCGG 
GAAGACTACT AAGATAAGAA 
AATCAGATCG TAAAAAAGTT 
AACGGATGCA AAACTTGGGC 
147 



ACTGATTGAT 
AACACCTCAA 
TGGTTCTATG 
AGAAGATGTT 
AGTACCAAAT 
AGCCATTATT 
TGTAGCCTTC 
TTGTAGCCTG 
GGATCGCTTC 
GCTGCACTTG 
ACTGGTATCG 
ACAATGATTG 
AAAGGTGACT 
CGTATCGCGC 
CCTTAGTGCC 
TGCCGTTGGT 
CGCTCTTGGT 
CGGTGTTGCT 
AGGAGCCGCA 
AGGACTGAAA 
TGGTGGCGTT 
TGGGCTTATA 



ATTTCAAAAG 

GATGCCCTTC 

GCTCACTCAA 

GCTACATTTG 

GATTCTAAAA 

TATGAAAGGA 

CTTTGCAGGT 

TACCCTTATT 

AAATGATTGC 

CTTCTGTCGC 

GTGTTGCCCA 

TTCGTACAAT 

TCGGCGCTGT 

TTCCTGCAGC 

ATGCCAGACT 

TACGCCATGG 

TTCGTTCTCG 

ATCGCTCTTA 

ACTTCTAACG 

ACATCATGAC 

CAACCTTCTT 

CACTCATTCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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AG 



1322 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

CTGGGATAAC CAAGCGAATC ACAGAACCGT CATTAGCCGG TCTGATACCA AGATCAGAAG 60 

CGTTCAAGGC ACGTTCGATG TCTTTCAATG AAGACTTGTC AAATGGTGTT ACCAACAAAA 120 

CACGCGCTTC TGGAATCGTA ATTGAAGCGA TTTGGTTAAG AGGAGTTTCG ACTCCATAGT 180 

ATTCTACATG TACACGGTCA AGCAAGCTTG CATTGGCACG ACCAGCACGG ATACCACCAA 240 

ATTCACGAGC AAGTGATTGG TGAGACTGGG TCATTCTCTC TTTAGCTTTT TCAATAATTA 300 

CGTTAGCCAT ATTCTTTCTT ATTCCTTTTC TTCGATATTA TTTGAAACTG TTGTTCCGAT 360 

ATTTTCACCA AATACGACAC GTTTGATGTT GCCTGATTGG TTCATGTTGA AGACAACCAA 420 

GTCAATGTCG TTGTCCATTG AGAGGGTTGA GGCTGTTGAG TCCATGATAC GAAGACCTTT 480 

GTTGATAACA TCACGGTGGG TCAATTCTTC AAACTTAACG GCTGTCTTGT CCTTCTTAGG 540 

ATCGGCATTG TACACACCAT CGACGCCATT TTTAGCCATG AGGATGGCAT CTGCTTCGAT 600 

TTCAGCTGCA CGAAGGCCGC TGTTGTATCT GTCGAGAAGT ATGGTGAACC AATTCCAGCA 660 

CCAAAGATAA CGATACGGCC TTTTTCAAGG TGACGAAGGG CACGTCCACG GACATAAGGC 720 

TCTGCCACTT GTTGCATAGC AATAG 745 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid- 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

CTGATAAATC TATATCCAGA TTTAGATTTC ATTGTGACGA CGGTAGCTTT GCAGGAACCA 60 

GCAAGTGTCC CGTTTGTCCT AGTTAGTGCT TTTCTAACCG AGGGTGATAA ACAACGTCTT 120 

CAAGCAAAAA TTCAGGAGAT AAACTATGAA TAATCTTTCG CTTGTCCTTA TGGATATATC 180 
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TGTTCAAAAT CGTCAAGAAG CCTACAAAGA ATTAGCAAAT CAAATCAGCC TTCTTGTTTC 240 

TGAAGATACA GAAAAAATAG AAGAGCTTCT ATATTACCGT GAGAGACAGG GAAGTATAGA 300 

GGTTGCTAAA GGTGTTCTTC TACCACATTG TGAAGGAAAC TTTCAACATC ATGTCTTAGT 360 

GATTACTAGA TTAAAACCAC CTATCAGAGA ATGGTCGAAG GATATCCAGT GTGTTGACCT 420 

TATTATCGGT TTGGCCATTG CAGTATCACA GGACAAGTCA TGTATTAAAA CATTGATGAG 480 

AAGACTAGCA GATGAATCAT TCATAAATCA ATTAAAACAG TTAACAAAAG AAGAATTACG 540 

GGAGATAATA TATGGAAATC AAAGATATTC TTAATGTGAG TCTGATCCAG ACGGATTTAC 600 

AGATGCAGAG CAAAGAAGAG GTTTTTGAGG CATTAG 636 

(2) INFORMATION FOR SEQ ID NO: 85: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1128 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

CTGGTACAAT CGGTGTGATG GACGCTATCG TGAAACAACC AGGAGTTAAA TCAATCATCG 60 

GTGGTGGTGA CTCAGCTGCC GCAGCGATTA ACCTTGGCCG TGCAGACAAG TTCTCATGGA 120 

TTAGTACGGG TGGTGGAGCA TCAATGGAAC TTCTTGAAGG TAAGGTTCTT CCAGGACTTG 180 

CAGCCTTGAC AGAAAAATAA GATTTTATAA ATAAATCAAA GAAGAGAGGG ATGAAAGTTC 240 

CTCTTTTCTT TTGCTTAAAA TAAAAACGCT TCCTCTCAAC TATTACTCAT AAAAATCACC 300 

GATTTATGAT AAAATGGAAA TAGAAAGTTG AGATTATGAG TTATTTTAAA AAATATAAAT 360 

TCGATAAATC CCAGTTCAAA CTTGGTATGC GAACCTTTAA AACAGGTATT GCTGTTTTTC 420 

TAGTTCTCTT GATTTTTGGC TTTTTTGGCT GGAAAGGTCT TCAAATTGGT GCTTTGACAG 480 

CCGTTTTTAG CCTGAGGGAG AGTTTTGATG AGAGTGTTCA TTTTGGGACT TCGCGTATTC 540 

TAGGAAATAG TATCGGTGGA CTCTATGCCT TGGTCTTCTT CTTATTAAAT ACCTTTTTCC 600 

ACGAAGCCTT TTGGGTGACC TTGGTAGTTG TTCCAATCTG CACCATGTTA ACCATTATGA 660 

CAAATGTAGC CATGAATAAC AAAGCAGGGG TTATTGGTGG TGTAGCAGCT ATGTTAATCA 720 

TTACCCTATC AATTCCAAGT GGTGAGACAA TTTTGTACGT GTTTGTGCGT GTATTAGAAA 780 

CGTTTATGGG AGTTTTTGTC GCAATTATCG TAAATTACAA TATTGATCGT ATTCGTCTCT 840 

TTTTAGAGAA AAAAGAAAAA TAATGTTACA TTTTATAACA TTATCAATTG ACGTTTGTCT 900 

TTTTTTAGAC TATAACAGAC AGAAAGAAGG AAATTGTAAA TGAAGGAAAA AGAATTTCGC 960 

CGAAATATGG CTGTTTTTCC TATCGGCAGT GTTATGAAGT TGACCGATCT ATCGGCGCGT 1020 

CAGATTCGTT ATTATGAAGA TCAAGAGTTG ATCAAGCCCG ATCGAAACGA AGGGAAATCG 1080 

TCGCATGTAT TCCTTGAAAT GACATGGATC GTCTGCTTGA AAATCATT 1128 

(2) INFORMATION FOR SEQ ID NO: 86: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

AGGCAGTTTC AGACTAATAT CCAAGTCGTA AGAAATGCCT GAAATAAGCT TTTCTAAATT 
GTCCAAAGCT TGCGGAAAAC GCTCTTGGAA TAGTTTCTCT AAAGAACTTG CTGATATAAA 
ACATCTTGTC TCCAACGCAA CTGTATTTCA TCTGTCACGA CGGGAACACG GGTATAGGTC 
ATCTGACTGA GCAAGAGGGT CGCATGATCC GCATTGTGAG TATCAATCAA CACAACTAGA 
TTTTTAACAG GGGTCAAAAA AGTTTCCTCC TGCCCCAACA AGAAAGTCTC AAACTCCTTG 
GCAATCATCG GCTAAACTCC TTGGACAAAC CTGGATACAC CTCGTGATCT CGTGTCAAAA 
AGTCCACTTT GAAGTAACTA TCATCAATCT CCACACGAGC ATAGAGACAT TCTCTGATGG 
TACCTCGTGG TTGACTGATA GAGCCTGGAT TTAGAAAGAG GATCTTGCCT TCCAACCAAG 
CACTTGGCAC ATGCAAGTGA CCATAGAGGC AGATAGCGGC CTCTTCCTCC TGAGCCCAGT 
AGTCCAACTT TTGAAAGTTG AAATTGATGT CAAACAAGTG ACCATGAGTT TGGATAATCT 
TGGTCGAACC AAGCTCAGTC ACCAGACGTT CTGGGTAGCC GGCGTAGAAG TCCATGTTCC 
CTTTAACAAC GCGGATGCCC TCCCAAAGTG GAGAATCCGG ACGTAGTTCA GAATCGCCGT 
TATGAAAAAC GGCATCGACT TTGCCCACAT AGCGATCACG GACTTCTTCC ACAATCAAGC 
TATCGCCATG GGAATCGCTC ATTACAATGA TGGTTTGCTT TGCCATGATG GAAATACCTC 
CAAAAGTTTC TTAACGGCTA AGGCACGGTG AGATTGACTA TTTTTTTCTT CCAGGGTTAA 
TTCAGCTGAT GACTCACCTG TCTCTCCTAC AAGGAAGAGG GGATCATAGC CAAAGCCATT 
TTCACCCTTA GGTTCAAAGT TAATATAACC TGACCAGTCT GCTTCAACAA CTAAACTTTC 
CTTATTTGGG CTGGCTACGA CTAGGGTTGT GTGGAACTGA GCCGAGCGGT CCTTGAGTTC 
AAAGACCATG GCCAATTCGT GCAAGAGTTT GGCATTATTT TCACGGTCAG TAGCTCCCAC 
ACCTGCGAAA CGAGCTGACC AGACGCCTGG TAAGCCACCA AGGACNTCGA CTTNAGACNA 
GATTCACTGC CAAAACCATC TTGNCCGTCA ATGAGAANTG GTTTCTGCC 

(2 1 INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1390 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
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CTAGACCATT TAACAGTGAT AGGGTTAAGA CGAGTGACAT CACCCCAGAT GATTGGTGGT 60 

TTTACCCCAC GGCATACCGT ATGGATTGTA AACCCAATCC AATTTTTAAG GAGGAAGGAG 120 

GTATCCTGAA CAAGTTTTGG ACCGGAAGTA ACTTCAAACC AATGTCATTA ACGGCTCAAA 180 

TTCAACCGTG GAAACAAGGA CATCAAAGTC AATATCTTCT TGCCAACTTG AATCCATTCG 240 

TCAATCGTTT CAGCAAGGGA AAGCATCGTA CTCTTTTTGA GACAATTCAC CTTTACGGTA 300 

AGCCAAACGT TTGGCACGAA CTTCTTTTGT TTGAGGGAAT GAACCAATCC TTGTT G TTGG 360 

AAAAGCTGGA AATTTGAAAC CTTCTCCTTG GATAGCTTCA CGTTCTGCAA AGGCTGGCAA 420 

ACGAGTGTAG TCTGCGTCTG TCAAGCCAGC GATACGCGCA CGAAGTTCAG CATTTTCACC 480 

AACACGCTCA GTCGCAAAGA GTTCTTTGTT GGCTGCAAGA GCTTCTGAAC CTTGACCATT 540 

GCGGATAGCA TCCAAGTCAC GGATTTCATC CAATTTTTCA ACTGCAAAGG CAAAGTGGTT 600 

CAAGAGTGCT GGTTCAAATT CTTCATTAGC AGTTGTAAAT GGCACATGAA GAAGTGAGCA 660 

AGAGCTTGTC AAAACGATGT TTTCAGCTGG GATTTGCTCA AGAACAGCCA AGCTCTTTTC 720 

GTAGTTGTTG CGCCAGATGT TTTTACCATT GACAATACCT ACATAGAGAG TCTTGTCAGC 780 

TGGGAAGCCA CCTTTAACGA GTTCAAGAGT TTTCTTACCT TCAACAAAGT CAAGACCGAT 840 

AGCATCTACT GGTAAGTTTA CAAGGTCAGC GTATACGTCA CGAACATCAC CGAAATAAGT 900 

TTGAAGCAAG ACTTCAAGAC CTTTTTTGTC AGCCAAGAGT TTGTTGTAAA GGTTCAAGAA 960 

AAGAGCTTTT TCTTCAGCTG TCAAGTCTTT TACAAGAGCC GCTTCATCCA ATTGGATGCG 1020 

AGTCGCACCA AGTTCAGCCA ATTTAGCAAA AACTTCTTGG TAAGCAGCCA CTAAGCTATC 1080 

TACGAAGTCG TCTGCTTTCA CGCCTTCTTC AAAGTCTGAC AATTGAAGGA AAGTGAAGGG 1140 

ACCTACAAGA ACAGGACGAG TGTTCAATCC AAGTTCTTTG GCTTCTTGGA ACTCATCGAA 1200 

AATCTTGTGA CCAGCCAATT TTACTTGAGT GTCTTTTTCA AATTTAGGAA CGATGTAGTG 1260 

GTAGTTAGTG TTGAACCATT TCTTCATTGG AAGGGCGCGA ACGTCCCCTT TTCCTCCCTG 1320 

GTAACCACGT CCCAAAGCGA AGTAGCGCTC AAGGTCAGAC AAGTCCAAGT TTTGAACTGA 1380 

GGCAGGCACT ACGTTGAAAA GGAAAGCCGC ATCTAGGAAG TTATCATAGT GAGAAAAGTC 1440 

ATTTGATGGG ATTTCAGTGA TGCCTTTCCC TTTGACAATG TTCCAGTGTT TACCACGCAA 1500 

GTCTTTTGCT GCTGCTAAAA GTTCTTCTTC TGAGATTTCT TTTCTAAAGT ATTTTTCAGT 1560 

TGTAAATTTT AATTCACGGA ATTCGCCCAA ACGAGGGAAA CCGATGATTG TAGTTGACAT 1620 

GATGTGTCCT CCAAAATTTG TTGTTGAAAC TATCTTAACA GAAAAGAAAG CGTCTGTATA 1680 

ATTGTAAAAA ATTAGGGTTT GATATAGTTT GAAACTATAT ATCTGTTTCG GACAAAAGAA 1740 

AAAGACTTGA AGCAAACGTC TCAAATCCTT TGTAATTCTT ACTTTACAGC TATATTCCAA 1800 

TTAGAATACT AAAACATGTT ATTAGTAATT CTTATAAGTG ACTATGACCT GTTATTAGAA 1860 

AAGACTATAA CTGATTCTAG TCAACTTTTT CCCTGTTCAA GTGGGACGAT TGCTAGTGTC 1920 

TTTCCTAAAC TGGCTAGGAC TTTTAAGACT GTATCCAACT GAGGACTAGT CTTTCCTGTC 1980 

TCCATCCTAG 1990 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1064 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

CTTTATAATN NACAANNACT TGGATTTGAA CACTGATTAC CCATATAATG AGATTAAAGG 
AGAGGTTAAA GTATGGCCAA GGATGGAAGT ATTCTTCTTC TTCGGAAAGA TTTTGTTAAG 
ATTAATTAAA GGAACCTGGC TTAAAAGGGA GCCAAAAGTT CTTTTATCCA TCCCGAAAAA 
CAAATTTTTG AAGCGGTTAA GGATTTATCG TTTGAGGTTC CAAAGGGACA AATTTTAGGT 
TTTATTGGTG CTAATGGGGC AGGTAAGTCA ACAACGATTA AAATGTTGAC AGGGATTTTA 
AAACCGACAT CTGGTTTTTG TCGCATTAAT GGGAAGATTC CTCAAGAAAA TCGTCAGGAT 
TATGTTAAGG ATATTGGTGT TGTTTTTGGG CAACGCACCC AGCTTTGGTG GGATTTGGCT 
TTACAAGAGA CATACTCAGT TTTAAAAGAA ATTTATGATG TCCCTGATGC AGTATTTCAA 
AAACGCATGG ATTTTTTAAA TGATGTTTTA GACTTGAAAG AGTTTATTAA GGATCCTGTG 
CGGACACTAT CATTAGGTCA ACGTATGAGA GCGGATATTG CGGCCTCCTT GCTCCACAAT 
CCCAAAGTTC TTTTTTTAGA TGAGCCGACC ATTGGTTTGG ACGTTTCGGT TAAGGATAAT 
ATTCGTCGGG CAATTACTCA GATCAATCAA GAGGAAGAAA CTACCATTCT TTTGACCACT 
CACGATTTGA GTGATATTGA GCAATTTGTG ATCGGATTTT CATGATTGAC AAGGGGCAAG 
AGATTTTTGA TGGAACAGTG AGCCAACTCA AGGAGACCTT TGGTAAGATG AAGACTCTCT 
CTTTTGAACT GCTACCAGGT CAAAGTCATC TCGTCTCTCA CTATGAAGGT CTGTCTGATA 
TGACCATTGA TAGACAAGGA AACAGCCTCA ACATTGAATT TGATAGTTCT CGCTACCAGT 
CAGCTGACAT TATCAAGCAA ACCCTGTCTG ATTTTGAAAT CCGCGATTTG AAGATGGTGG 
ATACGGATAT TGAGGATATT ATCCGTCGCT TCTACCGAAA GGAG 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

CTCTAGTGCC TGGTTTGGTA GTGATTTTGG TTGGGGATAA TCCAGCCAGC CAAGTCTAAC 
GTTCGCAACA AGGAGAGGTC AGCTCTTGCG GCTGGTTCCC GTACCGAAGT AGTGCGAGTT 
CCAGAGACCA TTACTCAAGA GGAATTGTTA GACCTGATTG CTAAATATAA TCAGGATCCA 
GCTTGGCATG GGATTTTGGT CCAGTTCCCA TTACCAAAAC ATATCGATGA AGAGGCGGTT 
TTATTAGCCA TTGACCCAGA AAAGGATGTG GATGGTTTCC ATCCCCTAAA CATGGGACGT 
CTTTGGTCTG GACATCCAGT CATGATTCCC TCGACACCTG CAGGAATTAT GGAAATGTTT 
CATGAATATG GGATTGACTT GGAAGGTAAA AATGCGGTCG TAATCGGTCG TTCAAATATC 
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GTTGGAAAAC CTATGGCTCA GCTTCTTTTG GCTAAAAATG CGACAGTAAC CTTGGCCCAC 480 

TCACGAACTC ATAATCTTGC CAAGGTGGCT GCTAAAGCAG ATATTCTTGT AGTCGCAATC 540 

GGCCGCGCCA AGTTTGTGAC TGCTGACTTT GTCAAACCAG GTGCGGTTGT CATTGACGTT 600 

GGGATGAACC GAGATGAAAA TGGGAAGCTC TGTGGAGATG TTGATTATGA GGCAGTTGCA 660 

CCACTTGCTA GCCATATCAC GCCAGTCCCT GGAGGTGTCG GTCCTATGAC CATTACCATG 720 

CTGATGGAGC AAACTTATCA AGCAGCACTT CGGACATTGG ATAGAAAATA AGAGAAACAT 780 

TCGATTAATA GAGTGTATTT TCTATAGCTA TATCTAAAAT GGCAGAAATG AATATTAAAT 840 

TTTAGATATA AGATTACAAA AGGAGGTCTG CGTCTCCTTT TTGTTGTATA ATAAAAGTGA 900 

GAGGGAAAAA GGATGAAAGT GATTAATCAA ACCTTACTAG AAAAAGTTAT TATTGAACGT 960 

TCTCGTTCAA GTCATAAAGG AGATTATGGT CGTCTGCTGC TGCTTGGTGG TACTTATCCT 1020 

TATGGAGGTG CCATCATCAT GGCTGCTTTA GCAGCTGTAA AAAGCGGTGC AGGATTGGTG 10 80 

ACCGTTGGAA CGGACAGGGA AAAATATCCC GGCTCTGCAC AGCCATTTAC CTGAGCCTAT 1140 

GGCCTTTTCT CTTCAAGACC AGCAATTGTT AAAAGAGCAA TTGGAGAAGG CAGAAGTTAT 1200 

CTTGCTGGGG CCTGGTTTAC GAGACGATGC TTCTGGAGAA AATCTAGTAA AACAGGTCTT 1260 
TGTTAATTTA AGCCAAAATC AGATTTTGAT TGTAGACGGA GGTGCCTTGA CCATTCTTGC 
TAGGACAAGT TTGTCATTTC CCTCAAGCCA GTCAATCCNG CCNCCCGGGG T 



1320 
1371 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 635 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

TTTTGGGGGT NTGGCTCCTT NGGTNCTTTA NAAAAGGCCC TANGGCCNTG ATCTGNAATT 
CCCATTNNTG GCTCNTNANC TACCATCCGA TTTNNCANGG NCCCCTNNCT TTCCAATTTN 
NTTGTTNANA ATNTGGTCAT CCTTGGANGT NCACTCGATG ANAGTGATTA ATCAATCCTT 
ATTANANNAA GTTATTATTG ATCGTCCTCG TTCAAGTCTT AAAGGAGATT ATGGTCGTCT 
GCCGCNGCTT GGTGGTANTT ATGGGGGCGG AGGTGNCACC TCCATNGCTG CTTGNTCNGG JUU 
TGNNNNAANC NGTGCTTGAT TGGTGACCGT NGGNNCTNAC CNGNNTTATT TCCCGGCTCT 360 
GCACNTCATT NNCCTGACCC TNCGGCCATT TCTCTTCAAN ACCAGCNATN GTTNAANGAC 420 
CNTNTGGNNA ATGNNGAATT TTNTCTTGCT GGGGCCTGGA TTTACTANNA CGATGCTTCC 480 
NGNANGAAAA TNTNGTTAAA ACATGGTCTT TGNTNAATTN TAATCCCNAA TTCCAAATTN 540 
TGAATGTATA ACNGAGGTGC CNTNGNCCAT CCTTGCNAAG AANAATTTGT TNTTTTCCCC 600 
TCAANNCAAA ATTGAAATNC TGGCNCCCNG GGGGG 

635 

(2) INFORMATION FOR SEQ ID NO: 91: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DMA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 



CTCCTGTGGT AATAGAGAAA AAACTTCCTG AAACAGGAAC TCACGATTCT GCAGGACTAG 60 

TAGTCGCAGG ACTCATGGCC ACACTAGCAG CCTATGGACT CACTAAAAGA AAAGAAGACT 120 

AAGTCTTTTC GATAAAAAAT AAACAGCGAG ATTGAAGCTC GCTGTTTATT TTTTAATTAA 180 

TCACCTAGTC CAAGACGTTC AAAGATATCA TCCACTCGTT TGGTGTAATA AACTGGGTTG 240 

AAGATTTCAT CGATTTCTTC TTGTGTGAGA CGTGATGTTA CTTCTGAATC TGCCTCAAGA 300 

AGTGGTTTAA AGTCTACTTG GTTGTCCCAA GAGTAGGCTG TTTTTGGTTG CACCAAGTCA 360 

TAGGCTTGCT CACGGGTCAT GCCTTTTTCA ATCAATGTCA ACATAGCCCG TTGGCTAAAG 420 

ATAAGACCAA AAGTCGAGTT CATGTTTCGG ATCATATTTT CTGGGAAGAC TGTCAAGTTC 480 

TTGACGATAT TTCCAAAACG GTTGAGCATG TAGTCAATCA AAATGGTCGT ATCTGGTGTG 540 

ATGATACGCT CAGCTGATGA GTGAGAAATA TCGCGTTCG1 GCCAGAGAGC GACGTTTTCA 600 

TAAGCCGTAA TCATGTGACC ACGAATGACA CGCGCCAGAC CAGTCATATT TTCAGAACCG 660 

ATTGGGTTGC GTTTGTGAGG CATTGCTGAA GACCCTTTTT GCCCTTTAGC AAAGAACTCT 720 

TCTACTTCGC GTTGCTCAGA TTTTTGTAGA CCACGAATCT CAGTCGCCAT ACGTTCGATT 780 

GAAGTCGCAA TGCTGGCAAG AACCGCAAAG TACTCAGCGT GAAGGTCACG AGGAAGGACT 840 

TGTGTTAAAG ATTCCTTGGG CACGGATGCC AAGTTTATCG CAGACATACT CCTCTACAAA 900 

TGGTGGGATA TTGGCAAAGT TCCCAACCGC ACCAGAAATC TTACCAGCTT CTACACCAGC 960 

AGCCGCATGC TCGAAGCGCT CCGATATTGC GTTTCATTTC GCTGTACCAA GTTGCTAATT 1020 

TAAGACCAAA GGTTGTCGGC TCAGCGTGCA CACCATGAGT ACGCCCCATC ATGATGGTGA 1080 

ACTTGTGCTC CTTGGCCTTG TCAGCGATGA TATTAGTGAA GTTTTCAAGG TCACGACGGA 1140 

TGATGTCGTT GGCCTGCTTG TAGAGGTAAC CATAAGCAGT ATCCACCACG TCGGTAGAAG 1200 

TTAACCCATA GTGAACCCAC TTGCGCTCTT CACCAAGAGT CTCAGAAACC GCACGCGTGA 1260 

AAGCCACCAC ATCGTGGCGC GTCTCCTGCT CAATTTCCAA AATACGGTCG ATGTCAAAGT 1320 

CCGCCTTCTT GCGAATCAAA GCCACATCTT CCTTAGGGAT TTCCCCCAAC TCAGCCCATG 1380 

CCTCGTCAGA GAGGATTTCC ACCTCAAGCC AAGCACGGTA TTTATTTTCT TCACTCCAAA 1440 

TATTCGCCAT CTCAGGGCGA GAGTAACGGT TGATCATGTG TTAATTTTTC CTTTCTTCTT 1500 
AAGAT 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DMA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

CGGTGGTGTT GGACTGGCTG CTCCCCAGTT AGATATCTCA AAACGCATTA TCGCTGTTTT 60 
GGTACCTAAT ATTGTTGAAG AAGGCGAAAC TCCACAGGAA GCCTACGATT TGGAAGCCAT 120 
TATGTACAAT CCAAAAATCG TCTCTCACTC TGTTCAAGAT GCTGCTCTTG GCGAAGGAGA 180 
AGGTTGCCTG TCTGTTGACC GTAACGTGCC TGGCTATGTT GTTCGCCATG CCCGCGTTAC 240 
TGTTGACTAC TTTGACAAAG ATGGAGAAAA ACACCGTATC AAACTCAAAG GCTACAACTC 300 
CATTGTTGTT CAGCATGAAA TTGACCACAT TAACGGTATC ATGTTTTACG ATCGCATCAA 
TGAAAAAGAC CCATTTGCAG TTAAAGATGG TTTACTGATT CTTGAATAAA GAAAATCCCG 
TTGCAAGACG GGGTTTTGTG TTATAATAGA GGCATGAAAA CAAATGATAT TGTCTATGGT 
GTCCACGCCG TTACCGAAGC CCTCCTTGCA AATACAGGAA ACAAACTCTA CCTCCAAGAA 540 
GATCTCCGAG GTAAGAATGT TGAGAAAGTT AAGGAACTAG CGGTGACATT TTGAACAAAT 600 
ACCAATTTTG AAATTGTTAT TC 

622 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 



CTCCACCTCG AAACTTAATC TTAGCCAGCA ATCCCTTAGA AACCTCGTGC TTTTTTAAAA 
AGGTCTTAAC CTTGACATGT TCATCTGCGA TAAATTCAAA CCTCATTCAT CCACCTCACC 

GATAAAGGCA TCCTTAACAC GGTTCCAGAA ACTGGTATGG CTAGGAGTCG CGACAAAGTG „ w 

AATCTTATGA TGGTCGATTT GATACTCAAT ACGCTCAATA TTACGGAAAG AATAAACGCT 240 

ATTGTCAACC GAAATAGTAT GATAATCGTT TCTTGTTGGA ATAAGTTCAA TCTTATCCTT 300 

CTTAGGCACA ATAATGGAAG AGCCCAGTGT TCGATAAACA CGATTATTAA GGCTGGCAAT 360 

TTCCGTTAAT TGCAAAGCTT CAATGGTAGG GTGTAAAACA GCACCGCCAA GAGACTTGTT 420 

ATAGGCAGTA CTACCAGTCG GTGTCGAAAC TGTTAGCCCG TCTCCACGAA AACGTTCAAA 480 

GGGAACACCA TTTATTACAA TATCTGCCAC CATGGTTCGA TCAGACCTGC GGATGCTGGC 540 

TTCGTTGAGT GCTCTGAAAA TCTTAACTTC ACCATTTTCA AGAAAGACCT TCACATTCAG 600 

AACAGGGTAA GAAACCCTTG CCCCAGTATC TAGCTGCAAA TTAGTCACTA GCTTGTCCAA 660 

CTCAAAATCA CGATAATCTG TATAGAAGCC CAAATGTCCA GTATGAAGAC CGATAAAGCG 720 
GACCTTGTCA AGCTGATTTT CGTACTTATG AAAGGCCGAC AAGAGCATAC CATCCCCGCC 
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AATGGAAATG ACAATATCCG GATTGGTATC ATTGAGTTAT AAACTGATTT CTCTTCAAAC 840 

GATCTCGCAA TTCATACAAA ACCCTTTGAC TCTGCGGTTT TCTATTGGCT ATCAGATCAA 900 

TTCGTTTACC TGTATTCTTC ATCTGTATCG TCACTGNNTC CAACACCGTC ATTTAATTTT 960 

CTACTCAAAG GATCAAAAAG TGCCTGGGCT TCTTGGATAT CATCACGAAT TTCACCCATT 1020 

TCTTCATCCA ACTGATGGGC GATTCTAGCT GTAATTTCCA GTCGCTTCTT AATCTCATCT 1080 

GGGAAATCCC CTTGGTACTT GTAGTTGAGA GAATGTTCTA TCGTTGCCCA GAAATTCATG 1140 

GCCAAAGTAC GAATTTGAAT TTCTGCCAAA ATAGTCTTAG CTCCATTGAT GGTATCAACC 1200 

GTATATTCTA CTACCACATG ATAGGAACGA TAGCCTGATG CTTTTCTATG AGTAATGTAA 1260 

TCTCGCTCCT GTATGATTCG CATATCCTGA CGCTTGTGCA AAATATCCAC TACTTCCTTG 1320 

ACGTCATCTA CAAACTGAAC CATCACACGT AAGCCAGCAA TATCCTGCAA ATCGTGTTCC 1380 

AAGGTCGCAT AAGTAATGCC ACGACGAGCC ATTTTTTCTT TGATGCTCTC AATTGGCTTG 1440 

ACTCGACCGG TCACAAACTC AATTGGAGAA TGCTTATTTT GCTTACGATA TTGCTTACGA 1500 

ATACCACGAA GTTTAATCTT TAACTCACCA ACAG , e ,„ 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

CTCGTAAAAA CACTAAAGGA GGTAAATAAC CAATGTTAGT ACCTAAACGT GTTAAACACC 60 

GTCGTGAATT CCGTGGAAAA ATGCGCGGTG AAGCAAAAGG TGGAAAAGAA GTAGCATTCG 120 

GTGAATACGG TCTTCAAGCT ACAACTAGCC ACTGGATCAC TAACCGCCAA ATCGAAGCTG 180 

CTCGTATCGC CATGACTCGT TACATGAAAC GTGGTGGTAA AGTTTGGATT AAAATCTTCC 240 

CACACAAATC ATACACTGCT AAAGCTATCG GTGTGCGTAT GGGATCTGGT AAAGGGGCAC 300 

CTGAAGGTTG GGTAGCACCA GTTAAACGTG GTAAAGTGAT GTTCGAAATC GCTGGTGTAT 360 

CTGAAGAGAT TGCACGTGAA GCGCTTCGAC TTGCTAGCCA CAAATTGCCA GTTAAATGTA 420 

AATTCGTAAA ACGTGAAGCA GAATAAGGAG AAGGCATGAA ACTTAATGAA GTAAAAGAAT 480 

TTGTTAAAGA ACTTCGTGGT CTTTCTCAAG AAGAACTCGC GAAGCGCGAA AACGAATTGA 540 

AAAAAGAATT GTTTGAACTT CGTTTCCAAC CTGCTACTGG TCAATTGGAA CAAACAG 597 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1023 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 



CTGCTTATGA TTTTCAAGTT TGGACTCGAA ATCAATGGCT TCCCCGTGCT TTTAATGATT 
GGCGATTCGT TCTCCTAGTT TCTGATTGGA AGTTTGGGAA TTTTCCCTGA TTTAGGAAAT 
GCGACTATTT TAGTCTTGGT TTCCTTGATT ATGTATACAG TTAGTGGAAT CGCTTATCGC 
TGGTTTTCAA CCATTCTGGC ACTCGTATCT GCCACTTCTG TCTTTGTCTT GACCACTATC 
AGCCTAATCG GTGTTGAGAC CTTTTCAAAA ATTCCAGTAT TTGGCTATGT AGCCAAGCGC 
TTTAGTGCCT TTTTTAATCC TTTTGCCGAT CGTGCTGATG CAGGTCACCA GTTAGCTAAT 
TCTTATTTTG CCATGGTCAA TGGCGGTTGG TTTGGTCTAG GTCTTGGAAA CTCGATTGAA 
AAACGAGGTT ATTTGCCAGA AGCTCATACA GACTTTGTCT TTTCTATCGT GATTGAAGAA 
TTTGGCTTTG TTGGTGCCAG TCTTATTTTA GCTCTCTTGT TTTTCATGAT TTTGCGGATT 
ATCTTGGTCG GTATCCGAGC GGAGAATCCT TTCAATGCCA TGGTTGCACT CGGTGTCGGA 
GGGATGATGT TGGTTCAGGT ATTTGTCAAT ATCGGAGGGA TTTCGGGCTT GATTCCATCT 
ACAGGAGTGA CTTTCCCCTT CTTATCCCAG GGTGGAAATA GTCTTCTAGT CTTATCAGTG 
GCAGTAGCCT TTGTCTTAAA TATTGATGCC AGTGAAAAAC GCGCTAAGTT GTACCGAGAA 
TTGGAAAATC AACCAATGAA CCTTCTGTTG AAGTAGGATA AAGAAAGGAT AGTTTATGTC 
TCTTCAAAAA TTAGAAAATT ATAGTAATAA AAGTGTTGTG AAAGAAGAAG TCTTGATTCT 
AACAGAATTA CTGGAAGATA TTACTAAAAA TATGCTTGCC CCAGAGACCT TTGAAAAAAT 
AATACAGTTG AAAGAATTAT CAACGCAGGA AGATTATCAA GGTCTAAACC GTCTAGTGAC 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



C TGGATTG AG AAAAGATAAC AAGCTATTCT CCACTCTCCA ATTCATCCAT ATCTTGTTCA 
AATTTTTTCT GAGCCCATTC GCCATAGCTC TTAAGACCAA GATTGCCAAT AAAGACCCAC 
GGAAGGTAAA TGACATAAGT AATGACCCAA GCAGACAGGT ATTTAAAATT CAAAGGATTG 
TGCTGATAAA TTTCTATGTT GAATTGATAA TTCTGCAACA TCAAAAGAGC CGTAATAGCC 
AAGGTTAGGA AAAAACAACC CAAAATCGTA AAATGAAAAC GACTATAGTA GGTCACTCCC 
AGATAACGGG CACGATTGAA AAAGTAAAAT GTCCCTATGA TGATAACGAT TAGCAGCATA 
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TTAGAATTAA AAAGGCTTGG TGCTAATACT GAAATGATAT AAGATAGGAG CGACAAAGCA 420 

ATGCAGATAT AGAAACTTTC AGAGCCCGCT TTATTGAACA GTTGTTCTTC TCTTTCGTCT 480 

AGTAATTGAT AATAATAAAA TCTATTTTTC ATCTTCTTCC TCCCAAAATA GTTGGTCTAG 540 

GGTTrrCCCT AAACATCTGC AAATAGACTG GCAGAGCGAG AGACTGGGAT TGTATTTTCC 600 

CGCCTCTATC AAACCAATAG TCTGGCGTGT CACCCCGACA GCCTCTGCCA GTTGACCTTG 660 

TGTTAAATCA CGCTCTACCC GAGCTAATTT TAATTTTAAA TTTTTAGCCA CCTTCGTCCT 720 

CCTTATAGTT TTAATACTCA TCTACGCTTA AAAAATCCAA AACCAACACA AGCTATCAG 779 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

TCATAAGGAA GCTGTCGCTC GTTCCGCTAA GGTATGGACA CCACGGTGAA CATTGGCATT 60 

GTCCTGCTCA TAGTAACTGT TAATAGCTTT CAGAACTACT AGTGGTTTTT GTGTCGTCGC 120 

AGCATTGTCC AGATAGACCA GAGGTTCATC ATTGACAATC TGATCTAAAA TTGGAAAATC 180 

CTTGCGAATC GCTTCTACAT CTAACATAGG CTTCCCCTTA GCGTTTTGAC AATTTCTCTT 240 

CGATAGTTGC AATCATTTCA TCACGAACTT CCTTGACTGG AATCTCCACG ATAACAGATC 300 

CAAGGAAACC ACGAACAACC AAACGCTCTG CAGTTGCCTT ATCCAATCCA CGACTCATGA 360 

GGTAATACAT GTCTTCTGGA TCAACTTGTC CGATAGACGC TGCGTGTCCT GCAGTGACAT 420 

CATTTTCATC AATCAAAAGA ATTGGGTTAG CATCTGAACG CGCTTGGTCT GAAAGCATGA 480 

GAACACGGCT CTCTTGTTGC GCATCTGCTC CCTTAGCACC CTTGATGATG TGGCCGATAC 540 

CATTGAAAGT CAAAGTTGCT TTTTCAAGGA TAACCCCATG TTGTAGGATA TTTCCGATAG 600 

AGTTGCAGCC ATAGTTAGTT ACACGAGTAT CAATCCCTTG TACCTGACGA CCACTTGAAA 660 



(2) INFORMATION FOR SEQ ID NO: 98: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

CTCAAGTTTA AAACCAAGCA AGTCAAGGAC AGTCTCTACA AGATTGCTGG AATTGCAGAT 60 

GTAGAAGTTG CTGAAACGCT TGGTATGGAA CATCCAGTCA AGTATCGCAA TAAGGCGCAG 120 

GTGCCCGTTC GTCGAGTGAA TGGTGTCTTG GAAACAGGAT TTTTCCGTAA GAATTCGCAC 180 

AACCTCATGC CCCTTGAAGA TTTCTTTATC CAGGATCCTG TCATTGACCA AGTCGTAGTA 240 

GCTCTTCGAG ACCTGCTCCG TCGTTTTGAT TTAAAACCTT ATGACGAAAA GGAACAGTCT 300 

GGATTGATTC GGAATCTTGT GGTGCGTCGT GGTCACTATT CAGGACAAAT CATGGTCGTT 360 

TTGGTGACAA CTCGTCCAAA AGTTTTTCGT GTTGACCAAT TGATTGAACA AGTTATCAAG 420 

CAGTTCCCAG AGATTGTGTC TGTCATGCAA AATATCAACG ACCAGAATAC CAATGCGATT 480 

TTTGGTAAGG AGTGGCGCAC TCTTTATGGT CAAGACTATA TTACGGACCA GATGTTGGGA 540 

AATGACTTCC AAATCGCTGG CCCAGCCTTT TACCAAGTCA ATACTGAAAT GGCGGAGAAA 600 

CTCTATCAAA CAGCCATTGA CTTTGCAGAG TTAAAAAAAG ATGATGTGGT TATTGATGCT 660 

TATTCTGGTA TTGGAACCAT TGGTTTATCA GTCGCCAAGC ATGTCAAAGA AGTCTACGGT 720 

GTTGAACTGA TTCCAGAAGC GGTTGAGAAT AGTAAAAAAA ATGCTCAGCT GAACAATATT 780 

TCAAACGCCC ACTATGTCTG TGACACAGCT GAAAATGCTA TGAAGAATTG GCTTAAAGAT 840 

GGGATTCAAC CAACCGTTAT CTTGGTTGAT CCTCCACGCC AAGGGCTTGA CAGAAAGCTT 900 

TATCAAAGCA AGCGCCCAAA CAGGAGCCGA TCGCATCACC TATATCTCCT GCAATGTTCG 960 

CAACCATGGC GCCGTTGATA TTAAACTATA CCAAGAGTTG GGATATGAAT TGAAGAAAGT 1020 

CCAGCCGGTG GATCTATTTN TTCAAACGCA TCACGTCGAG ACGGTAGCAC TTTTGTCCAA 1080 

ACTCGATGTC GATAAGCACA TAAGTGTTGA AATTGAGCTG GATGAGATGG ATTTGACAAG 1140 

TGCGGAGAGC AAAGCAACAT ATGCTCAAAT CAAAGAATAT GTTTGGAATA AATTTGAATT 1200 

AAAAGTTTCG ACATTATATA TTGCACAGAT AAAAAAGAAA TGTGGAATAG AATTACGAGA 1260 

ACATTACAAC AAGTNTAAAA AGGATAAACA AATTATTCCA CAGTGTACAC CTGAAAAAGA 1320 

AGAAGCCATC ATGGATGCTT TGAGACANTT CAAAATGATT TAATAGAAAA GAATGACAGT 1380 

ATATGACTTT CTGCATTTAT TACATTCCTA CTTGGTATAG GAACAG 1426 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 991 base pairs 

(B) TYPE: nucleic acid 

( C ) . STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 



60 



CTCTGACGGA GGCTGGTTAT GTGGGTGAGG ATGTGGAAAA TATACTCCTC AAACTCTTGC 
AGGTTGCTGA CTTTAACATC GAACGTGCAG AGCGTGGCAT TATCTATGTG GATGAAATTG 120 
ACAAGATTGC CAAGAAGAGT GAGAATGTGT CTATCACACG TGATGTTTCT GGTGAAGGGG 180 
TGCAACAAGC CCTTCTCAAG ATTATTGAGG GAACTGTTGC TAGCGTACCG CCTCAAGGTG 240 
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GACGCAAACA TCCACAACAA GTAGATGATT CAAGTGGATA CAAAAAATAT CCTCTTCATC 
GTGGGTGGTG CTTTTGATGG TATTGAAGAA ATTGTCAAAC AACGTCTGGG TGAAAAAGTC 
ATCGGATTTG GTCAAAACAA TAAGGCGATT GACAGAGAAA CAGCTCATAC ATGCAAGAAA 
TCATCGCTGA AGACATTCAA AAATTTGGTA TTATCCCTGA GTTGATTGGA CGCTTGCCTG 
TTTTTGCGGC TCTTGAGCAA TTGACCGTTG ATGACTTGGT TCGCATCTTG AAAGAGCCAA 
GAAATGCCTT GGTGAAACAA TACCAAACCT TGCTTTCTTA TGATGATGTT GAGTTGGAAT 
TTGACGACGA AGCCCTTCAA GAGATTGCTA ATAAAGCAAT CGAACGGAAG ACAGGGGCGC 
GTGGACTTCG CTCCATCATC GAAGAAACCA TGCTAGATGT TATGTTTGAG GTGCCGAGTC 
AGGAAAATGT GAAATTGGTT CGCATCACTA AAGAAACTGT CGATGGAACG GATAAACCGA 
TCCTAGAAAC AGCCTAGAGG TGACTATGGA ACTTAATACA CACAATGCTG AAATCTTGCT 
CAGTGCAGCT AATAAGTCCC ACTATCCGCA GGATGAACTG CCAGAGATTC CCCTAGCAGG 900 
GCGTTCAAAT GTTGGTAAAT CCAGCTTTAT CAACACTATG TTGAACCGTA AGAATCTCGC 960 
TCGTACATCA GGAAAACCTG GTAAAACCCA G 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



991 



(2) INFORMATION FOR SEQ ID NO: 100; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1140 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

CTGGCTTGGC TCTGATTGTC AATAAAGGAG GTTGATGTTG ATAACCTAAC GACAGAGCAA 60 

CTTCGTCAAA TCTTCATAGG TGAGGTAACC AATTGGAAAG AGGTTGGTGG TAAGGACTTA 120 

CCCATCTCTG TTATCAATCG GGCAGCCGGC TCTGGCTCTC GTGCTACCTT TGATACTGTC 180 

ATTATGGAAG GTCAGTCTGC CATGCAAAGT CAGGAGCAGG ATTCAAATGG AGCGGTAAAA 240 

TCAATCGTAT CAAAAAGTCC AGGAGCTATC TCTTATTTAT CTCTTACCTA TATAGATGAT 300 

TCGGTCAAAA GCATGAAGTT GAATGGCTAT GACTTAAGTC CAGAAAATAT AAGTAGCAAT 360 

AATTGGCCCT TGTGGTCTTA TGAGCATATG TATACATTGG GGCAGCCCAA TGAGTTGGCT 420 

GCAGAATTTC TCAATTTTGT TCTCTCGGAT GAGACCCAAG AAGGGATTGT CAAAGGATTG 480 

AAGTATATTC CGATTAAGGA AATGAAGGTT GAAAAAGATG CTGCCGGAAC TGTGACAGTG 540 

TTGGAAGGGA GACAATAATG AATCAAGAAG AATTAGCTAA GAAAATGTTG CTTCCATCAA 600 

AGAATTCTCG TCTGGAGAAA TTAGGAAAAG GTTTGACCTT TGCCTGTCTT TCTTTGATAG 660 

TCATCCTTGT GGCCATGATT TTGGTTTTCG TAGCGCAAAA AGGCTTGTCG ACCTTCTTTG 720 

TCAATGGTGT GAATATCTTT GACTTTCTTT TGGGAGGAAC TTGGAATCCT TCTAGTAAAG 780 

AATTTGGTGC CCTTCCTATG ATTTTGGGTT CCTTTATCGT TACCATTCTC TCAGCCCTTA 840 

TCGCAACACC CTTTGCTATT GGTGCAGCAG TTTTTATGAC CGAAGTATCA CCAAAAGGGG 900 

CGAAGATTTT GCAACCAGCT ATTGAACTCC TGGTTGGGAT TCCTTCAGTA GTGTACGGAT 960 

TTATTGGCTT GCAAGTCGTC GTTCCCTTTG TTCGCAGTGT CTTTGGTGGG ACTGGTTTTG 1020 
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GGATTTTGTC AGGGATTTCC GTCCTCTTTG TCATGATTTT GCCGACCGTA ACCTTTATGA 1080 
CAACGGATAG CTTGCGTGCG GTTCCTCCNT TATTATCGTG AAGCCAGTTT CGCTATGGGA n 40 

(2) INFORMATION FOR SEQ ID NO: 101: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

CTGGGACTCT CTTCGTAGAA GTCATGCAAG AATATTTTGA TCAAAAGAGG AAATCATGAA 
AAAAAGAGCA ATAGTGGCAG TCATTGTACT GCTTTTAATT GGGCTGGATC AGTTGGTCAA 
ATCCTATATC GTCCAGCAGA TTCCACTGGG TGAAGTGCGC TCCTGGATTC CCAATTTCGT I 80 
TAGCTTGACC TACCTGCAAA ATCGAGGTGC AGCCTTTTCT ATCTTACAAG ATCAGCAGCT 240 
GTTATTCGCT GTCATTACTC TGGTTGTCGT GATAGGTGCC ATTTGGTATT TACATAAACA 
CATGGAGGAC TCATTCTGGA TGGTCTTGGG TTTGACTCTA ATAATCGCGG GTGGTCTTGG 
AAACTTTATT GACAGGGTCA GTCAGGGCTT TGTTGTGGAT ATGTTCCATC TTGACTTTAT 
CAACTTTGCA ATTTTCAATG TGGCAGATAG CTATCTGACG GTTGGAGTGA TTATTTTATT 
GATTGCAATG CTAAAAGAGG AAATAAATGG AAATTAAAAT TGAAACTGGT GGTCTGCGTT 
TGGATAAGGC TTTGTCAGAT TTGTCAGAAT TATCACGTAG TCTCGCGAAT GAACAAATTA 
AATCAGGCCA GGTCTTGGTC AATGGTCAAG TCAAGAAAGC TAAATACACA GTCCAAGAGG 
GTGATGTCGT CACTTACCAT GTGCCAGAAC CAGAGGTATT AGAGTATGTG GCTGAGGATC 
TTCCGCTAGA AATAGTCTAC CAAGATGAGG ATGTGGCTGT CGTTAACAAA CCTCAGGGAA 
TGGTTGTGCA CCCGAGTGCT GGTCATACCA GTGGAACCCT AGTAAATGCC CTCATGTATC 
ATATTAAGGA CTTGTCGGGT ATCAATGGGG TTCTGCGTCC AGGGATTGTT CACCGTATTG 
ATAAGGATAC GTCAGGTCTT CTCATGATTG CTAAAAACGA TGATGCGCAT CTAGCACTTG 960 
CCCAAGAACT CAAGGATAAA AAGTCTCTCC CGCAAATATT GGGCGATTGT TCCATGGAAA 1020 
TCTGCCTAAT GATCGTGGTG TTAATTQAAA CGCCCGAATT GGCGAATGAA AAAGAACCTT 1080 
AAGAAACCAG GTGGTAACTG GCTAAAGGGA ACCCGGCATG ACCCCTTTTC CCCTTCTTGG 1140 
AACGCCTTGG CGAAT 



60 
120 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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(2) INFORMATION FOR SEQ ID NO:102: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1030 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 





SEQUENCE DESCRIPTION: 


SEQ ID NO: 


102 : 








TGTGGCTGGT 


GCTCTTGCCA 


TGACAGCAGG 


TTATTGTGGG 


ACCTTATTGA 


60 




TGCTAATTTT 


AACGCTCTAC 


CAGCAGCATT 


GATGGATATG 


AAAGATCAGA 


120 


ATGGPGTTAT 


AAAGGCTCAA 


GCAGGTGTTG 


CTCTAGTAAT 


GATTGTTATT 


CACATATTCT 


180 


TAATRTAf^TT 
x An x w x x x 


TCTCGCATTT 


TAGTAAAGGA 


AAATAAAATG 


AAAATATTGG 


TTACAGGTTT 


240 


lnnl\.Ll 111 


GGAGGTGAAA 


AGATTAATCC 


AGCTTTGGAG 


GCTGTAAAAT 


TATTACCATC 


300 


lununi X/Wil 


GGGGCTGAAG 


TTCGCTGGGT 


AGAAATTCCA 


ACGGTTTTTT 


ATAAGTCGTC 


360 


AunAUi illn 


GAGGCAGAAA 


TATTACGATA 


TCAACCAGAT 


GCTGTACTTT 


GTATTGGACA 


420 




AGGACCGGCT 


TAACACCTGA 


ACGAGTGGCT 


ATTAATCAAG 


ATGATGCTCG 


480 


1 w x 


AACGAAGGCA 


ATCAACCAAT 


TGATACACCG 


ATTCGTATTG 


ATGGAGCATC 


540 


uuUwinl 111 


AGTAGTTTAC 


CTATCAAAGC 


GATGGTACAA GCTATCAAAA AACAAGGACT 


600 


TCCGGCAGTT 


GTATCCAATA 


GTGCAGGAAC 


CTTTGTTTGC 


AATCATTTGA 


TGTACCAAGC 


660 


TCTCTATTTA 


GTAGATAAAG 


AAATTCCCTA 


ATATGTTAAG 


AGCAGGTTTT 


CATGCATATT 


720 


CCATATATGA 


TGGAACAAGT 


AGTGAACAGA 


AGCCGAATAC 


TCCAGACTAT GAGTTCTATG 


780 


TGGATATTCG 


GCGTAAAGGC 


ATAGAAGCAG 


CAATCGGCGC 


TATGATAGAA 


CATGGAGATC 


840 


AGGAACTCAA 


GTTGGTAGGC 


GGAGAAATTC 


ATTGATAGAA 


AAAAGCTTGA 


GGGGAAAACC 


900 


TTCAAGCTTT 


TGGACGTTTT 


CGAGCCAATA 


CTGCTCGGTA 


AAACATAATT 


TTAGTGCATT 


960 


GGATATAAGG 


TAGGAGTGAA 


AAACTAGCAA 


TGCCAAAGGT AATCCAATTG AGGAAGTACC 


1020 


AAGGAAGAAG 












1030 



(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 584 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

CTGTAAAAGA AGTTAATCAA GTTGATAAAG CACTTTTAAA ACAAATTGGT GCAGTTGATG 60 

TCTTAGAAGT GAAGGGTGGC ATTCAAGCAA TCTATGGAGC AAAAGCAATC TTATATAAAA 120 

ATAGTATTAA TGAAATTTTA GGTGTAGATG ATTAAGTACT TACTGACTTA ATAAAAAACA 180 

GAGGAGAGTG ATGGATGAGT AGGATGAAAT GAAATCGCAT ACAAGAAATA AAGAACTCAT 240 

TATCCAAGTT GGATACGCTT ATTACATAGG AGAATACAAA TGAAATTTAG AAAATTAGCT 300 

TGTACAGTAC TTGCGGGTGC TGCGGTTCTT GGTCTTGCTG CTTGTGGCAA TTCTGGCGGA 360 

AGTAAAGATG CTGCCAAATC AGGTGGTGAC GGTGCCAAAA CAGAAATCAC TTGGTGGGCA 420 
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TTCCCAGTAT TTACCCAAGA AAAAACTGGT GACGGTGTTG GAACTTATGA AAAATCAATC 480 
ATCGAAGCGT TTTGAAAAAG CAACCCAGAT ATAAAAGTGA AATTGGAAAC CATCGACTTC 540 
CAGTCAGTCC TGAAAAATCA CAACAGCCAT CGAAGCAGGA ACAG 



584 



(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 957 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : Single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

CTGGAATTAA AACGCACGAG TATTGCACGA ATAACCAACC AAACAACCAC TCAGACCATG 60 

TGGATCCATA CCCTTACTTG GCAAAATGGG GCATTAGCCG TGAGCAGTTT AAGCATGATA 120 

TTGAGAACGG CTTGACGATT GAAACAGGCT GGCAGAAGAA TGACACTGGC TACTGGTACG 180 

TACACTCAGA CGGCTCTTAT CCAAAAGACA AGTTTGAGAA AATCAATGGC ACTTGGTACT 240 

ACTTTGACAG TTCAGGCTAT ATGCTTGCAG ACCGCTGGAG GAAGCACACA GACGGCAACT 300 

GGTACTGGTT CGACAACTCC AGGCGAAATG GCTACAGGCT GGAAGTCTCC AACTCTTCTC 360 

TTTTTTAATA CAAGCTATTT TGATTTAACC GGCTGGTCTT GAGCTGTCTG CAAAGCTGTG 420 

GCAATCGTAT CTGCATACAA TTTTGCTCCT GCTTCGATAG TGCTACTCTC ACTCCCGAAA 480 

TGAACCTGGT CTGTTCCAGC CCAAATTTCT GGATGCTCTT TCGCAACTTG ATTCCAATCT 540 

GCTATCGTAA TGTAAGGTGT CTTCTCTGCC AATTCTCTCA TATAGGCAGC AGCCTTCTCA 600 

ACGATGGCAT AGGTCTCTTT TGTCTTATCT CCCTCATAAG GAGTCACCAA AATCATATGG 660 

TGTCCCTTAG GAAGATTTTT CACGATACTG TCCCAGTCAT CCTTGTAATT CTCAGGATTA 720 

TTTACCCCAG TCGCAATGAC CACCGTCTTA GGTAAAAATT TATTCTGGCT ATTATTTAGC 780 

ATGATTTCAT TTGCGGTCTT GGTTGTTACG CTGACCTGCG CGTTAATCTG TGCTCCAGGA 840 

AGAGCTGTCT GTAGTGCTGT ATTTGCCCTT AAAGCCACTG AGTCACCAAT TAACATAGTG 900 

CCATCAGCAA TTCCCAAACT GTTTGCATCT GCCCGTTCTG CCATCACCTT GGTCTGG 957 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

163 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

CTCGTGCGAC GACAGATGTT ATTCGTCATG GTGCACCAAA GGCAGAGATT GAGGGGCTTT 60 

TCTCAGTTGA GAATAGTCGT CTTTTACAGG AAATTTTTGA TGAGCAAGGT TTAGAATTAG 120 

GTGATGAAAT TATCATCCGT CGGGAAATCT TGCAAAATGG TCGTAGTATC AGCCGTGTAA 180 

ATGGTCAGAT GGTTAATCTG TCTGTTTTGC GAGCTATTGG GCAACATCTT GTAGATATTC 240 
ATGGTCAGCA TGACCATGAG GAGTTAATGC GTCCCCAACT GCATATCCAG ATGTTGGATG 
AATTTGGTGA TGCCGCTTTT TGGGACTTGA AAGAAACCTA TCAAACGAGT TTTGATGCCT 
ATCGGAAAAT GCGCAAGCAG GTTCTGGAAG TCAAGAAAAA CCAACAGGAA CACAAGGCAC 
GTATCGAAAT GTTGGAATTT CAAATGGCAG AGATTGAGGC AGCAAACTTG CAGGCTGGAG 
AAGACTTGGC TCTCAATCAA GAGCGAGATA AACTCCTCAA CCACAAAAAT ATTGCGGATA 
CACTGACCAA TGCCTACAGT ATGTTGGACA ATGAAGATTT TTCAAGTCTG GCCAATGTTC 

GTTCAGCTAT GAATGACATG GAAAGTGTCG AAGAGTATGA TCCTGAATAC CGTGAAATTT DOU 

CAAGCTCTCT GTCTGAGACC TACTATGTTT TAGAAGACAT TAACAAACGT TTGGAAGCTA 720 

TCATTGAGGA CCTTGATTTT GATGGCAATC GCCTGATGCA GGTTGAGAAT CGTTTGGACC 780 

TCCTTCATAC TATTACTCGT AAGTATGGTG GGACTGTTGA TGATGTTTTG CTTTATTTTG 840 

CCAAGATTAC GGAAGAATAC AATCTCTTGA CAGGCAATAA TCTTTCCGTC TGAGGACATG 900 

GAAGCAGAGC TTAAGAAGTT GGAAGTCAAT CTTGTCAATT TGGCAGGTCA ACTTGCTTCT 960 

GCTCGTCATA ATTTGGCTCA GCAACTCGAA GCTGAGATTA AACAAGAACT GCAACATCTT 1020 

TATATGGAAA AAGCCCAGTT TCAGGTTCGT TTTAGTAAGG GAAAATTCAG TCGTGAGGGA 1080 

AATAAAATGG TTGAGTTTTA TATTTCAACC AACCCTGGAG AAGACTTTAA ACCCTTGGTT 1140 

AAGGTTGCTT CTGGAGGGGA ATTATCTCGT CTCATGTTAG CCATTAAGTC TGCCTTTTCA 1200 

CGTAAAGAAG GCAAGACTAG CATTGTCTTT GATGAGGTGG AAACGGGAGT TTCAGGTCGT 1260 

GTTGCTCAAG CTATTGCTCA GAAAATTCAT AAAATTGGTC AGCATGGTCA GGTTTTGGCT 1320 

ATCTCCCATT TGCCACAAGT AATTGCGATT GCAGATTATC AATTCTTTAT TGAGAAGATT 1380 

AGTAATGACC ATTCAACGGT TTCGACTGTT CGTCTCTTGA CGGTCGAAGA GCGAGTGGAG 1440 

GAAGTTGCCA AGATGTTGGC AGGTGATGAT GTGACAGAAG CAGCCCTGAC GCAAGCCAGA 1500 

GAATTGTTGA GAAACAGGGA GAAATAAGAT GACAGATTAT TATGTAATTG GAGATGTTCT 1560 
CGGAAAAG 



(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 



300 
360 
420 
480 
540 
600 
660 



1568 



AGGAGACCAG CCAGAAATTG TGGAAGTGTC ACCAGATGCC CTTATTCCTT 

164 
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ACGCTGCAGT GGTACCGCAT CAGGCAGAGG TCATTTTTCG CCAGCGCTTT GAGATGACAG 120 

ACAAGGGAAT TGAAAGAACG AATTGAGTCT TATTATGATG GGGGAAAATC TTTTCCGCTC 180 

TATCAAGATT CGGGGGGAAT TTTTACATAT GCATGTGCGT ATGATTCCTA AGTCAACACC 240 

CGATACCAAG TTTGCTGATG TCGCAACCCA TCAACCGGAA TATAGTCGTG ACAATGTTGC 300 

GGGGACTATT GTTGGTTTCT GGACGCCTGA GATTTTCCAT GGGGTCAGTG TGGCAGGCTA 360 

CCATCTGCAC TTCATATCAG ATGATTTGAC CTTCGGTGGA CATGTCATGG ATTTTGTCAT 420 

CAAGGAAGGC ATTATCGAGG TGGGAGCAGT TGACCAGTTG GACCAACGTT TCCCAGTCCA 480 

AGACCGTCAA TACTTGTTTG CTAAGTTCAA TGTTGACGAG ATGAAAAAAG ATATTGAAAA 540 

GGCAGAATAG GAGAAGAAAA TGACCATTCA TATCATTATT ACCATGTTGC TGTTGCTAGC 600 

TTTTCTGATA GGAAGCATTT GGTTTGCCAA AAAGAAATAT CAGATTAATC TAG 653 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO:! 


107: 






CTATTATTGC 


AAGGTNNTGA 


AGTCGAACGA 


AAAAATTACT 


CTAGTCCTTT 


TGATTTGGAA 


60 


TGTCATCATT 


TCCTTGATTT 


ATGGAATTGA 


CATATTCAAG 


GCAAGAAAAA 


GAGCTTGGCG 


120 


CATCCCAGAG 


AAAATCTTAC 


TCATATTACC 


CCTTACTTGT 


GGTGGTTTGG 


GGGCCTGGTT 


180 


TAGCTGGAAT 


CACTTTTCAC 


CACAAGACTC 


GAAAATGGTA 


CTTTAAAACA 


GTTTGGTTTC 


240 


TTGGGATGGT 


GACCACACTA 


GTAGCCTTAT 


ATTATATTTG 


GAGGTAATGG 


ATGGCAGGGT 


300 


CTTCGAGGGA 


CCACGCTGCT 


TGGGCTCTAG 


CGGACTATGG 


TTTTAAGGTC 


GTGATTGCAG 


360 


GATCTTTCGG 


TGACATTCAT 


TACAATAATG 


AACTCAATAA 


TGGCATGTTG 


CCAATCGTTC 


420 


AGCCTAGAGA 


GGTTAGAGAG 


AAACTAGCCC 


AGTTAAAACC 


AACCGACCAG 


GTAACTGTGG 


480 


ACTTGGAACA 


ACAAAAAATC 


ATCTCACCAG 


TTGAAGAATT 


CACCTTCGAG 


ATAGATAGCA 


540 


AGTGGAAACA 


TAAACTCCTA 


AATAGTTTGG 


ATGATATCGG 


TATTACCTTG 


CAGTATGAAG 


600 


AGTTGATTGC 


TGCTTATGAA 


AAACAACGAC 


CAGCCTACTG 


GCAGGATTAG 


AAAAAATAGA 


660 


AAAGGAAATA 


TAGTAAACTG 


AAATAAGATG 


TAAACAAATG 


AATTGGAG 




708 



(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 



CTCGTGTTGT 
GCAATGAATT 
TCAAGACCAT 
GACTGCCCGT 
TTTGCCGTAA 
CTATGATCAA 
CCCTGATGTC 
TCGTGACACC 
CGAAGAGATG 
ACTCAAAGAT 
TCCAAAGGAT 
AGACGAAGCG 
CCAAGGTTGG 
AGTGGTTGCT 
CCACCGTGTG 
GACAGTTCTT 
TCGTGTCGAC 
TGCCGTTCGT 
CCAGCAGAAG 
TGCTCAGATT 
CCGTTCAGAA 
CGGCTTGACC 
TGCCTTGGTG 
G 



GGACGAGTTA 
GTTCGAAGGT 
CTGTCAGTAT 
TTACGATGGA 
ACATTATTTT 
CTACAAGCTG 
GTTTCAGACA 
GTAATAGCCT 
ATTAAGGAAT 
GCCAAGGCTG 
CCAAACGATG 
GCACTTTTCG 
CGCTTTGAAG 
ATGGTTTCAG 
CAACGTGTTC 
GTTATGCCAG 
ATCTATCACG 
ATCGTTCACT 
AACCGCGAGA 
GCTCAGGATG 
CGGATCCGAA 
CTCCAAAAAC 
CTTTATGACC 



GACATACCCG 
TCCAAATATC 
TGCAAGAAAA 
GAACAGATCC 
GCCCCTGAAA 
TAGAAGACCG 
CCAAGCGTTT 
ACCGTGAGTA 
CAGGCGGAGA 
AAAAAGAAGA 
ACAAGAATAT 
CTGGAGATTT 
TCATGGAAGC 
GTCAGTCTGT 
CTGTGACAGA 
AAGTTGAAGA 
CCTCTGGTGC 
TGCCAACCAA 
AGGCTATGAA 
AACAAGACGC 
CTTATAACTT 
TAGATACGAT 
AAACACAAAA 



TCATGGCTTT 
TCTTGCTTTT 
AGGCGACCAT 
AGATTGGTGG 
TCAATAAGGA 
TTATGAAGAA 
TATGGAGCTT 
TAAACAAGTC 
TGCGGACTTG 
ATATGAAGAA 
CATCCTTGAA 
GCTAACTATG 
CTCTATGAAT 
ATACTCTAAG 
AAGCCAAGGC 
GGTTGAATAC 
TGGTGGACAG 
TATCAAGGTT 
GATTATCCGT 
TGAGCGTAAG 
CCCACAAAAC 
TTTGTCTGGT 
ACTAGAAGAA 



TGGCTTGAAA 
AGCGGACAAG 
GGTGTTGCGA 
TAATGAAACC 
GAATGAAGAA 
CTAGGAGAAT 
TCAAAAGAAG 
CTTCAAAATA 
GAAGAATTGG 
AAACTGAAAA 
ATCCGTGGAG 
TACCAAAAGT 
GGTGTCGGTG 
CTTAAGTATG 
CGTGTTCATA 
GACATTGATC 
AACGTCAATA 
GAGATGCAGG 
GCACGCGTCG 
TCGACAATCG 
CGTGTCACAG 
AAATTGGACG 
TTAAACAAAT 



AATGACTTTC 
ATTGACGAAA 
ACACAGGATG 
TATATCTCGG 
AAATGAACAT 
TGCTGAGTGA 
AAGCTTCAAA 
TCGTCGATGC 
CCAAGCAAGA 
TTTTGCTCCT 
CAGCTGGTGG 
ATGCGGAAGC 
GTTTTAAAGA 
AATCAGGTGC 
CTTCGACAGC 
CAAAAGACCT 
AGGTTGCGAC 
AAGAACGTAC 
CTGACCACTT 
GTACTGGTGA 
ACCACCGTAT 
AAGTTGTGGA 
AATGAAATTA 



(2) INFORMATION FOR SEQ ID NO: 109; 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 1276 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1381 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 



CTCCCATTTG CGATACTGGG ATACCTGTCA TTCGTTCCAC AGATTCAGCC ACATCGTTGA 

166 



60 



WO 97/43303 



PCT/US97/07950 



CACTTGCAGT CACTTTCATA TCTTCTGTGT GGTTTTCGAT TTTCCTTTCC AATTCTGCAA 120 

TGCGTGTTTT ATAGTTTAGA GCTGCTTCAA AATCTTCTGC TTCAACTGCT TTTTCTTGCT 180 

TGTCTTTTTC CGTTTCGATT TCTCGTTCAA CAGCATGCAC ATCTGTTACT GGATGTTGAG 240 

CCGCCAAGTG AGCAGCCGTT ACATCGACAA GGTCAATAGC CTTATCTGGC AAGCTACGTT 300 

GAGGAATGTA TTGAACAGAA TAATCCACCG CTGCTTTCAA GACTTCGTCT GGCAAGATGA 360 

CATTGTGGTG TTGTTGATAG AGGTCACGAA TGCCTTGAAG AATTTTAAAA GTATTCTCTG 420 

CTGAAGGAGC ATTGACCTTC ACTTCGTTGA AACGACGAGC AAGAGCAGCA TTCTTCAAGA 480 

AGGTGTTACG GTATTCGTCT TGAGTTGTTG CCCCAATCAC TGTCAATTCT CCACGAGAGA 540 

GACTGGCTGA GAATATCCGC AAGCCCTTTA GAACCACTGT CTCCACAAGT GCTACCAGCA 600 

CCAAGAATTG TGAATTTCAT CAAAGAAGAG GATAATATTC CCTGCTTC TT TCACTTCATT 660 

GACTAAGTTT TGGACATTTT CTTCAAAGCT ACCACGGTAT TGAGTACCAG CCTCAAGACC 720 

TGAGATATCA ATAGAAATAA TTTCCTTGTT CTTGATAGCA GCAGGAACAT CTCCGTTCAC 780 

AATGGCTTGC GCTAGACCTT CGACAACTGC TGTCTTACCA ACACCTGCAT CTCCGACCAA 840 

AACAGGATTG TTCTTGGTGC GGCGTGAGAG GATTTCAGAT GTTTCTTGAA TTTCCTTGTT 900 

TCGTCCGATA ACAGGATCCA ACTTGCCCTC ACGCGCTTCT GCTGTCAAGT TTCGACCTAG 960 

TTTTGCAAGG ACACCGTCTT GTTTCATACC TGATGCCTGT TGTGGCATTT GCACATCAGT 1020 

TTCTGCATTT CCTGGTAATT GACCAGTCGT ACGATAGTGA GCAAATTCCT CAGGTCTGAC 1080 

TTCGCGTCCA TTAATTAAGT AACGGCGATT TTCAGAACTG TATCCTCGCA TACCACCCAT 1140 

CAATTGGTTA AATAAATCAT CCATGTTATT AAAATTATTA AAGTTGTTAT TCATATTCTT 1200 

TACCTCTTTT TGTTTACTTA GTTATGATTA CTGATATTGA CTATCTTTGA CCTTTGTTTT 1260 
AAAAAATTTA GACTAG 



(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



1276 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE- DESCRIPTION: SEQ ID NO: 110: 

CTTGGCAGAA TTTGCCGGTC TATCAGACGC ACTTAAAGGG CAAGGACTCT AAGTATGCAG 60 

GGACAAATCA TTAAAGCCTT GGCAGGTTTC TACTATGTGG AGAGTGATGG CCAGGTTTAT 120 

CAAACACGCG CGCGTGGGAA TTTCCGTAAA AAAGGCCATA CCCCTTATGT TGGGGACTGG 180 

GTACATTTCT CTGCCGAGGA AAATTCAGAA GGCTATATCC TAAAAATTCA CGAACGGAAA 240 

AACAGTCTAG TTCGTCOGCC TATTGTCAAT ATCGATCAAC CTGTAGTAAT CATGTCCGTC 300 

AAGGAACCTG ATTTTAACAG CAATTTGCTG GATCGTTTCT TGGTTCTTTT GGAGCACAAG 360 

GGCATCCATC CCATTGTCTA TATTTCCAAA ATGGATTTGT TGGAAGATAG GGGAGAACTG 420 

GATTTTTACC GGCAGACCTA TGGTGACATC GGCTATGACT TTGTGACCAG TAAAGAGGAA 480 

CTCCTGTCTT TGTTAACAGG CAAGGTTACG GTCTTTATGG GGCAGACAGG TGTTGGGAAG 540 
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(2) INFORMATION FOR SEQ ID NO: 111: 



600 



TCAACTCTTC TCAATAAACT CGTACCAGAC CTCAATCTTG AAACGGGAGA AATTTCAGAC 

AGTCTAGGTC GCGGTCGCCA TACCACTCGA GCTGTTAGTT TTTACAATCT CAACGGGOGT 660 

AAAATCGCAG ATACACCAGG CTTTTCATCT CTGGACTATG AAGTATCAAG GGCTGAAGAC 720 

CTCAATCAGG CTTTCCCAGA GATTGCTACT GTTAGCCGAG ATTGTAAGTT CCGTACTTGT 780 

ACCCATACCC ATGAGCCGTC TTGTGCCGTC AAACCAGCTG TTGAAGAGGG TGTTATTGCA 840 

ACCTTCCGTT TTGATAATTA CCTGCAATTC CTTAGTGAAA TTGAAAATCG TAGAGAAACC 900 

TATAAAAAAG TCAGCAAAAA AATTCCAAAA TAAGGAGAAA CCTATGTCTC AATACAAGAT 960 

TGCTCCGTCA ATTCTGGCAG CAGATTATGC CAACTTTGAA CGTGAAATCA AACGTCTAGA 1020 

A GCAACT GGG GCAGAATATG CCCATATCGA TATCATGGAC AGTCATTTTG TACCGCAAAT 1080 

CAGTTTTGGT GCAGGTGTGG TCGAGAGCCT TCGTCCTCAT AGTAAGATGG TTTTCGATTG 1140 

CCACTTGATG GTGTCAAACC CTGAACATCA TCTGGAAAAA TTTTGCGCCC GTTGCAGGGT 1200 
TGCAAAAAAT TCATCCAGTT ATCCCATGTT TAAA 



1234 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 945 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 



60 



CTCTTAAGAC CAATTCCGAG TTACTTCTTC ATCAGCCTTT AACCGATCCA CTAATTGGTC 

AACTGAGTCA AATTTGGTCA TATCTCGAAT GCGATCAAGC CAATAAACCA TGACGGTTTC 120 

CCCATAAATA TCTTGATTAA AATCAAAAAT ATTGACTTCA AAACGTGCTT CTTCTCCATC 180 

AAAGGTCACA TTTTTCCCGA CACTAGCCAT AGCACGATAC TTCTGTCTTT GAATCTCAAC 240 

ATCAACGACA TAAACGCCAT CTGCTGGCAT ATAAGTACGG TCTAAAAGCA CTAAATTCGC 300 

TGTCGGATAA CCAATTGTAC GACCACGAGC ATTACCATGA ACTACCATAC CTCTTGATGG 360 

AAGCGGTGCC CCCAAAAGTT TTCCTGCTTC TTTCACATTT CCATCTAAAA TAGCTTGACG 420 

GATACGAGTT GAACTAATCT TTCCTTTCTC ATCTTCTACA GGTGGAACAA TGATAACTTC 480 

TCCATCAAAG TAATCCTTTA AATCTTCTGC TGTTTTTTTG TCAGAACCAA ATGTATAATC 540 

AAAACCTGCA ACAATAATTT TGGCATTCAT AGCCTTGATA TAAGTTGCAA AGAATTCTTG 600 

TGCAGTGAGA CTAGCGAATT GACTACTAAA ATCAAGGAGA TATAATTCTT CTACACCTTC 660 

GCGCTTTAAT TTTCTTTCAC GTTCAGCAGG GTTCAAAATA TGCAAAAACA AATCTGGATG 720 

ATAAGGCTCT AAAGCGATCT TTGGAGATTC ATTAAAGGTC ATAACGACGA TAGGCAATAA 780 

ATCCTTTCTC GCAGCCTTGT TGGCAACACG AAATAATTCT TGATGCCCCT TATGTATGCC 840 
ATCAAAATAG CCGAGAACAA CGACTGAATC AGATGGTGTG CCAATATCTT TTTGGTTTTT 
TATAGGAATA GTAATAATCA TAAAATAATT ATATCATAGC GATAG 



900 
945 



(2) INFORMATION FOR SEQ ID NO: 112: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 653 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

CTATTTTAAA AGGAGAATTG GCAGATGGCG ACTACTGAGT TGATTGAACT GGCAATTGAA 60 

ACCAGCAAAC ATGCCTATGT CCCCTATTCT CACTTTCCTA TCGGAGCAGT TTTAGTAGCC 120 

AAAGACGGAA ATGTTTACAC GGGAGTGAAT ATCGAAAATG CTAGCTATCC TTTGACCAAT 180 

TGCGGTGAGA GAACAGCTAT TTTTAAGGCT ATATCTGAAG GCCAAAGAGA ATTTTCAGAA 240 

TTGATTGTCT ATGGTCAAAC TGAAAAACCG ATTTCACCAT GTGGTGCTTG TCGCCAAGTA 300 

ATGGTCGAAT TTTTTGAACA AGATTTAAAA GTGACCTTAG TCGCAAAAGA TAAATCGACG 360 

GTCGAGATGA CGGTCGGGGA GTTACTTCCA TATTCTTTTA CAGACTTAAA TTAGTCTGAG 420 

TCGCTCTTTG AGTGGCAAGG GTCTTTGTGA CCAATCAATC CATACTTGCA ACATCGTTGC 480 

ACATCTTATT TAGGAGGTTC AGTAATGAAC AAGAAACAAT GGCTAGGTCT TGGCCTAGTT 540 

GCAGTGGCAG CAGTTGGACT TGCTGCATGT GGTAACCGCT CTTCTCGTAA CGCAGCTTCA 600 

TCTTCTGATG TGAAGACAAA AGCAGCAATC GTCACTGATA CTGGTGGTGT TGATGACAAA 660 

TCATTCAACC AATCAGCTTG GGAAGGTTTG CAGGCTTGGG GTAAAGAACA CAATCTTTCA 720 

AAAGATAACG GTTTCACTTA CTTCCAATCA ACAAGTGAAG CTGACTACGC TAACAACTTG 780 

CAACAAGCGG CTGGAAGTTA CAACCTAATC TTTGGTGTTG GTTTTGCCCT TCACAATGCA 840 

GTTGAAGAAG CAG 853 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1007 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

CTCAGCTACT TCGCCTTTCT TTTTATTCTA CTGGTTTTTC TTGATTTCCA GTAGTTGTAG 60 

AAGATTCTGT TGTTTTATTT TCTGAAGTTG ATTCAGCAGG TTTAGAATCT CTTGTATTGC 120 

TTGGTTTGTT TTCGTCGCTA GCAGTTTCAA TGTTAGATTC TGCAGTTGCG TTTGGTTGGT 180 

TCTCAGCACT GGTGTTATCA CCATTTGCTT CAGCATTTCT TGCTGGACTT GTTTCTTCAC 240 
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TTGCGCTAGC TTTTGACTGG ATTTGATGAT TCAAAACTAG AATAGCTTTT GTCGATTCAA 300 

GTAAAGCTGT TTTGTCTTTA CTATTAGCAG AAAGTTGATC TAATAATGCA TCCACCTTAT 360 

CAAAAGTCCG CATCAGATCC ATTATTACTT TCTAAATAAA AGTCAAGCGA CATGAGAATA 420 

TCGTAGAGTT TTTGATAGAG TACAAGTGTC TGAGGATCTT GCTCAGCATT TTCCTTTTCT 480 

TGTTGAAGGG CGCTAGCGAT ACGAGTCAAG ACATCTTTTA CCTGACTGTT TACTTCATCC 540 

AAGTCTGCAT CAGCCTTGTT TGTGGCAGCT TTTAGATTTT CTACTTCTTC TGCCAAAGAT 600 

TGTCTGATTC CTTCTTCATG GATTCGTTCC AAGAGTTGAT TTGCCTTGCT CAAAAGACTT 660 

TCTACTTCTT CCTTGCTATC TGTCGCAGAT TATTGGTTGC TATCTACCAT GTACTCCTAA 720 

AACAGGAGAG TTATAATCCA AGATTACAAG GCCTTACAGA AATAAGAAAT CCAGATAAGA 780 

CAATGTTCGT CCAAGACGCT ATTCGCTTCG CACAGCAGCA CGGATTCAAT ATGCTTTAAT 840 

TTTAAAGTTT AGGTGTCAAG ACCTCTTTTT AGTGTGCCCA AAATTTAGAG AAGTAATCAA 900 

TCAACTAACT TTTATTTTTT TCAAACTTTC AGTAAACTGA CCTAAAGCTA ACTCAATCTG 960 

TCTTTGTTCG ATAGGCTTGT CTTTGTAGAT GCTTCTGCTA TCAGATC 100 7 

(2) INFORMATION FOR SEQ ID NO: 114: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 896 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

CTCATTCAGA GCTTGAAAAT GCAGCCTGTG TCGTGATTGG GACAGGGATT GGCGGAGCCA 

TGATTATCAA TGGTAGACTT CATCGAGGTC GCCACGGCTT GGGTGGAGAA TTTGGCTACA 120 

TGACAACCCT TGCCCCTGCT GAAAAACTTA ATAACTGGTC GCAACTAGCG TCAAACTGGG 180 

AATATGGTAC GATACGTGAT TGAAAAATCT GGTCATACTG ATTGGGACGG TCGCAAGATT 240 

TACCAAGAGG CCGCAGCTGG TAATGCTCTT TGTCAAGAAG CCATTGAGCG CATGAACCGC 300 

AATCTGGCGC AAGGCTTGCT CAATATCCAG TATCTCATCG ATCCAGATGT CATCAGTCTG 360 

GGTGGCTCTA TCAGTCAAAA TCCAGATTTT ATCCAAGATG TCAAGAAGGC TGTTGATAAC 420 

TTTGTCGATA CCTACGAAGA ATACACGGTC GCACCAGTTA TCCAGGCCTG CACCTATCAC 480 

GCAGATGCCA ATCTCTACGG TGCTCTTGTC AACTGGCTAC AGGAGGAAAA GCAATGGTAA 540 

GATTTACAGG ACTTAGTCCC AAACAAACGC AAGCTATTGA GGTTTTAAAA GGTCACATTT 600 

CTCTACCAGA TGTGGAAGTG GCTGTCACTC AGTCTGACCA AGCCTCTATC TCTATCGAGG 660 

GTGAGGAAGG TCACTATCAA TTGACCTACC GCAAACCTCA CCAACTCTAT CGTGCCTTGT 720 

CCTTGTTGGT AACAGTTCTA GCAGAAGCTG ATAAAATAGA GATCGAAGAA CAAGGCAGCT 780 

TACGAAAATT GGCTTACATG GTTGACTGTT CTCCAAATGC GGTGCTGAAT GTGGCTTTCC 840 

TGCCNAAGCA GATGACTGAA GGTCTTGGCT CTCNTGGGCT ACTCCACCCT TTGAAG 896 



(2) INFORMATION FOR SEQ ID NO: 115: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 594 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

CTAGCAACAA GNGGNGCTAA AATCAAGGTT ACCTTATCGC CGACACCACC AGTAGAATGC 
TTGTCAACTT TCACACCATC AATGGCTGAC AGGTCAAACT CTTGCCCAGT CTTAACCATA 
TTCATCGTTA AATCAGAGAT TTCTCGAGTC GTCATTCCTT TAAAATAAAC AGCCATAGCA 



60 
120 
180 

AAGGCAGACA TCTGATAATC AGGAACAGTT CCTGATACAT AGCCTTCTAT CAGCCATTCA 240 
ATTTCACTTG AAGTCAGTTC TTGACCGTCT CGTTTTTTTT GGATTAAATC AACTGCTCTC 300 
ATTCTTTCAC ACTTCTAAGG ATATAGTATC CTTTGTCTTT TTTAAGGATT TCACAATTAC 360 
CAAACACATC CTCCATCTTG GACTTAGCAC TTGGAGCCCC TTGTTTTTTC TGGATGACGA 420 
TTGTCAAATC CCCACCAGTT TCCAAGAAAT CTTTACTTTT CTCAATGATT TCATGAACGA 480 
CTTGCTTGCC CGCACGGATA GGAGGATTGG AAATGACATG ATCAAAATGT CCTTCAACTT 540 
GTTCATAGAT GTTGGACTGG AAATCTCCTT GCATTATTTT TCACATTCTC CAAG 594 

(2) INFORMATION FOR SEQ ID NO:116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE- DESCRIPTION: SEQ ID NO: 116: 



60 
120 
180 



CTAGTGTTTT ATAGCCTGTT TCTTTCACAA ATTGAGCAAA TTCTTGATTC GTAACTGGAG 
TATCCGCAAT GGAAAAATCC TTTACACTAA CAGCGACTTG AGGACCTTCA AAATCATCTA 
AAAAGCCCTC TTCAGAATTT GTTCCCATCA TAAATGTCCC ACCCTTGATT TTTTTCATTT 
TCATGATTGA CTATCCCCCG TTTTCGTTAG AGTTGGCGTT ATTTTAGAAA GTTCAACAGG 240 
AACTAACTCT TCATCCACAA CAAAACCTTC TTCTCTATAT CTAAGAAAGT CTACTAATTT 300 
TTTCTTCATC TGACGAACAA TAGGTTGATA TTTTTCACTT GGATACAAAT CATTCATTTC 360 
ATGCGGATCT TTTTTCATAT CAAATAACTG ATAATGATTT AAAACAGGAA ACCAGATAAA 420 
TTTCCACTGG TCTGTCAAAA TATATTGACT AGAATCTTTT CCTAAAGCAT GTTCACCATG 480 
AAATTCTGTT CGCCACCCCT CGTACTGTCC AAATAATAAA ' TTCCTTAACA CTCCGTCCAT 
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CTAGTTCATC TGTAGTAGTT CCTCCCTGCC AAATCCACGA AGTGAAGGAA AAATATCTTG 600 
AATTTTCACT AATTGCTTGA TTGTTCCCTA TTACCAGATC GAN 6 43 

(2) INFORMATION FOR SEQ ID NO: 117 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1077 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

CTGTTCTCAA TCATATTGTC AGTCGCTTTG ATATCATTGG TATCCTTGGT AACGGGGCCG 60 

ATTTTGCCAT TCTTGAGCAA GCCAGCGTCC CAAGATTGTG ATATCTTTAT CGCCCTGACT 120 

GAGCACGATG AAGTCAACAT GATTGCAGCA GTTCTAGCCA AGAAAATGGG AGCTAAAGAA 180 

ACTATCGTTC GGGTGCGGAA CCCTGAATAT TCTAACTCTT ATTTCAAGGA AAAGAATATT 240 

CTCGGTTTTT CTCTTATCGT TAATCCTGAG CTCTTGGCTG CCCGCGCTAT CGCGAATATC 300 

ATTGACTTCC CCAACGCCCT GTCTGTCGAA CGCTTTGCTG GTGGACGCGT TAGCCTCATG 360 

GAATTTGTCG TCAAGTCCAC CAGCGGTCTT TGCCAAATGC CCATTTCTGA TTTTCGTAAA 420 

AAATTTGGTA ATGTCATTGT CTGTGCGATA GAGAGGGATC ATCAAATTAT CATTCCAAGC 480 

GGTGACATGA CTGTACAGGA TAAAGATAGA ATCTTTGTCA CTGGTAACCG TGTCGATATG 540 

ATACTCTTCC ATAATTATTT TAAATCACGC GCCGTGAAGA GCCTTCTCAT CGTTGGAGCA 600 

GGTAGAATTA CCTATTATCT ACTTGGTATT CTCAAAGATA GTCGTATCGA TACAAAAGTC 660 

ATTGAAATCA ATCCTGAAAT CGCCAGCTTC TTTAGCGAGA AATTCCCAAA TCTCTACATC 720 

GTTCAAGGAG ATGGAACCGC AAAAGATATC CTGCTGGAAG AAAGTGCTCA ACACTATGAT 780 

GCCGTTGCGA CTCTAACAGG AGTCGATGAG GAAAATCTGA TTACATCTAT GTTCCTTGAC 840 

AGGGTAGGTG TACAGAAAAA TATTACTAAG GTCAATCGTA CCAGTCTCCT CGAGATTATC 900 

AATGCGCCTG ATTTTTCAAG TATCATCACA CCTAAAAGCA TCGCTGTAGA TACGATTATG 960 

CACTTTATTC GTGGTCGAGT TAATGCCCAG TATTCAAGAC CTTCAAGCCA TGCACCATCT 1020 

AGCCAATGGC CAAATCGAAA CCCTGCAATT CCATATCAAG GGAAGCCAAT AAAATGA 1077 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1499 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:118: 

CTTGAATGGC AGAGCATTCA GCGTAGTCTC GCCCCCAACT ATAAGGAGAG AAGTCAGGGT 60 

GAAAGTTTCC TTGATTGTCT CGTGCTCGCA TGTAACCTGT CTCAGCGTCA AATAGCTGGC 120 

GGTAATTTTG TGAAGCAGCC TTGTAGGTTT CAGCGATTTC TATGTTCTCT AGTTTTTTGG 180 

CACAGCTGGC GATACAAAAG TCACTATAGG CATAGTCTAG AGTATGGCTA ACACTTTCGT 240 

GGTGGTCGGT AGAGAGGTAA CCTAGTTCTT GGTATTGGGC TAGTCCGTGG CGGCCATTGA 300 

TTCCGAGAGG GTCGGCTTTG CTGGCTGTTT CGGAGCATGG CTTGGAAGAG TTCTCCTTCT 360 

AGGTCAGGGG CCATGTCCTT GCAGGCGCTA TCTGCGATAA TACCGTCTAA AAGTGTACCT 420 

GGCATCATAC CCCGTTCATC TGGAGCCAGC CATTTTGGAA GGAAACCAGT ATCGCGGTAG 480 

CTATTGAGGA AACCTTCTAA AAAGCGTTGA TAGTGCTCCG GTATGATAAG GGCAAAGAGG 540 

GGGAAGGTGG TGCGGAAGGT ATCCCAGAAA CCATTGTTGC TAAAGAGGAC ACCAGGCTTG 600 

ACAGTACCAG TAGCCAGATC CATGTGGATG GCTTGCCCTG ATTCATTAAT CTCATAAAAA 660 

GTCTGTGGGA AGAGGAAGAG TCTGTAGAGG CAGTGGTCAA AGAAGGTTCG GTCAGCCTCT 720 

CCTATCTCTA TAATGTCAAA ACGATGGAGG AGATTTTCCC AATCCGCTTG GGCACTTGAT 780 

TTACAGCTAT CAAAATCTTC TTGAGGTAGA TTAAGCAGTG CTTGAGAAGG AGAGATGAAA 840 

GAAGTGGCTA GTTGCATCTC GGTTTGACTA CTTGCTAAGT CAATTCGCCA GTCTCCAACT 900 

TCTTGGCTGA TAGCAAGAAT ATCCGTGTTC ATTTGCAGGG CAGTGAAGAG CATCAGTGGA 960 

TTTTTATTGG TCTCTGTTTT GCCTTCTTGT CTGAGAGCCA GAGTCCGCTT ATCTACCTGC 1020 

TCAACTGTTA GTTCATCTGC TGCGTGAAGA TAGAGGGAGA AGGGCTTTGC CTTGCTTTTG 1080 

CTTCAAACGA ATAGAAGCAC CATAGCAAGT CGGTGTGAGC TGGGTTTCAA TTTGATAGCG 1140 

CAGGGAAAAG AGCTTCAAAT AGTGAGGCTG GAAGCAAGCT TTATCTATAT CATAAGAAGA 1200 

CTGGCGGTGA AAGAGGCTGT CTCCACCCAG TTGACTGGTG ACAGGTGTCA GAAGGAGCCA 1260 

AGAGTAGTCC CCAATCCAAG GACTGGGCTG GTGAGTTAAT CGAATCCCCT GAAAGATAGG 1320 

CAGATGCGGA TCAAAAAACC AAGATCCATC CTGGTCACTG GTCTGGGGCA CAAAATAATT 1380 

CATCCCAAAA GGCACGCCTG TGTATGGCAG GGTATTTCCC CGAGAGAAGG CATGCTTGCT 1440 

AGCAGTACCA AAGCGGGTAT CGATGGTTTC AAGTAGTGGT TTCATAGTCT TTCCTTTAG 14 99 

(2) INFORMATION FOR SEQ ID NO; 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 864 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 



CTCCATCTGC AATCGGCATC AGACAAGATC TCCAAGTGTT GTTTCACGTA ATCAATACGA 
CCATCATCGT AAACAGTGTT ATTAACGAAC TCATCTTTAT AGCCGAGACC ATTTTCAGTG 

173 
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ATATAAATTT TCTTGTAGTT AGGATAATCT TTCTTCACAC GCATGATTTG GTCATACAAA 180 

CCTTGAGGGT AGATAATCCA ATCCCAATCC GTGCGTGGAT ACATAGTCAG GGAGCTACAA 240 

GACGACCAAT ACCTTTGATT TGATACTTAG AGCTTCCTTT TTCACCTTTA CCATTATGGA 300 

TAATTTCAGT TTCTCCATCA AAGGCTTCCA TCCAGTCACT CATATAGTAG TTGATTCCTA 360 

GGAAATCATT CAAGTCTTTT GCAGCTTCTA ATGCTGTAAA ATCTTCTTCA CGAAGATCCA 420 

AACTACCACC ATTGACTGAT AAGATATGGT TGACACCTTC CATGGTTTCA GCTGAATAGC 480 

GACCTAGATA AGTTCCGTCT AAGATGAATT TATTGTGGAT GATATCTTCC AACTCTGCTG 54 0 

CACGAACATC TGCTGGATTT TCAGGATCTA GAGGATATTT AGTAGGCAGG GCGTGAACAA 600 
CACCAATTTC CCCTTTATAG CCTTTCTCTT TGTACAATTT TACCGCGCAT GCATGAGACA 
CCATCATATT GTGGTGTGAT TGAAAAACTT TGGCAAGGTC GTACTGGATA CCTGGAGGGA 
ATTTCCCAAC CAAATATTGA CCATCACCGA TTGGTCCAAT TTCATTAAAG GTTGTCCAAT 
AGTTTACTTC TGGAAATTCT TCCAAAACAG AAGGCAGCGT TAGTCTACAA AATGTTCGAT 
ATTTTCACCG TTTAAGAAGT CTCC 



660 
720 
780 
840 
S64 
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(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

ACGAGCAATN TNTTCGGATC CATTTCTTCT CCTTTGTAAT AAGATAAAGT CCGTTTATCA 

AATTTTATTT AGTGTTAATT TGGTTAAAAC AAGCAAACTA AGCTGCTTTC TTGGTTAGTT 120 

GTGACAAGAT TGCCAAACGA TTTTGACGGA CAGCCTGATC TTCAGCCATT ACCATAGTAT 180 

TTTCAAAGAA AGCATCAATG ACTGGGCTAA GCGCAAAGAG TTGTTTCAAT TGCTGACTTG 240 

CAGGTCCTGA TAAAACGAGT GTTTCTACTG CTTCTGCCAA AGCTTTTTCT TGGTCATTCT 300 

CAAATAGTGC TGAATCAACT GTAGCAACCC CTTCTGCCTT CTCAGCCAGG TTAAAGGCAC 360 

GAGAAAGTGA TTCAACAGAT GGTTTAAAAT CTTCTTCCTT GCTTACTTCT ACGAGAGCAC 420 

TTGCTGCTTC CAACATATCT GCCACAACAA AGTTTGAACC TGCAAGAACT GCTTCCTTGA 480 

TATCTTTTGG AGTAGAGCCC ATCATCTTAT CAACACGAGC CTTGATAAAG TCCATAACCT 540 

CTGCTTTATT TTCATAAGTC AAACTGTCAA ATTTCAATGC ATAAAGGCTA TCAATCAGCT 600 

CCAACTGTAT AACAGAGTCC GCCAG : ACC ATGAGAATCC AGAAAACGGG TGTNGTTTGA 660 

CTGATAATGG CAGGAATGAT AGCCA "- ACC AACCAGCCCA TAATCAGGTA AAGAGCAAGG 720 

CTAAATTTCT CATTGACCTT TTTAGC.iAAG ATTTTATAGA GAATACCAAA GATGGTCGTT 780 

CCCCATTGGA TGACAATAAT CAGATAGCCA AACCAGTTAT TCATCAAGGT CAAGACAACG 840 

GGCGTGTATG AGCCGGCAAT GGCAACGTAA ATCATAGAAT GGTCAATGAT TCGCAAAACA 900 

TATTTGTGGG TCGAACCATA GGCCATAGAG TGATAAATGG TGGATGATAG GAACATGAGA 960 

AAGAGACTGA TGACGAAAAT GGAAACGCCG ATAGAGGATA AAAATCCGTG TGCTTCATAA 1020 
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CTATAGATGG ATGAAATAGG CAGCAAGATA AGCATGATGA CTGCACCCAC AGCATGGGTC 1080 

ACGCTATTAG CAATCTCCTC TCCAAAACTG AGTTGTTTGC TGAGTTTAAG ACTAGTGTTC 1140 

ATTGGATTAC CTCCTCTTGA GTATGATCGA TTAAGTCTAG AGTTTGATGA TAGAGTTTAA 1200 

CGGTTTGGCA GCTGGTT TG G ATAATAGGGT TAG 1233 

(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:121: 



TACTATTAAC 


CTTCCTGTGA 


CTAAAGATCT 


CCACCTAAAC 


CGTTCGCGCT 


CAGCTCAATC 


60 


TTCGTATCAA 


ATCCAGTACA 


AACAAGGGGC 


TCTTATCTGC 


CCAACTGCTA 


GAATTTGGGC 


120 


AAAAGAAATA 


CCTACAACCT 


TATCCAGCTA 


TTTTAAGTGC 


TAGAACCATT 


GATAACGGTC 


180 


GCTATCACAT 


GTTGGAAAAT 


CTCTGTGAAT 


TGCCATTTAG 


ACCAGAGGCA 


CAACGAGTCG 


240 


TGACAAAAGG 


TTACCTTAAT 


TTACAAAATA 


GAAATGATTT 


ACTGTTAGTA 


GAGGATATTA 


300 


CTGCAGATGA 


ATGGATGGAC 


GTTCAATTTG 


AACTGCAGCC 


AACTATTTAC 


AAGCTAAAAG 


360 


AAGGAGACAC 


TCTCCGTTTA 


GTCCTCTATA 


CTACTGACTT 


TGAAATCACC 


ATCCGTGACA 


420 


ATACAGACTA 


CCACTTGACT 


GTCGACCTCG 


CTCAGTCCAT 


GCTTACCTTA 


CCTTGCTAAA 


480 


AGGAGTTACA 


TTATTATGAA 


TAAATCAGAA 


CACCGCCACC 


AACTTATACG 


CGCTCTTATC 


540 


ACAAAAAACA 


AGATTCATAC 


ACAGGCTGAG 


TTGCAAGCCC 


TTCTTGCTGA 


GAACGACATT 


600 


CAAGTAACCC 


AGGCAACCCT 


CTCACGCGAC 


ATCAAAAATA 


TGAACCTATC 


AAAAGTCCGC 


660 


GAAGAAGATA 


GCGCTTATTA 


TGTTCTTAAC 


AATGGTTCCA 


TCTCAAAATG 


GGAAAAACGT 


720 


CTCGAACTCT 


ACATGGAAGA 


CGCCCTTGTC 


TGGATGCGCC 


CAGTTCAACA 


CCAAGTCCTA 


780 


CTAAAAACCC 


TTCCTGGACT 


GGCTCAATCC 


TTTGGTTCTA 


TCATTGATGA 


CTTTGAGCTT 


840 


CCCTGACGCT 


ATCGCTACCC 


TTTGTGGTAA 


TGATGTCTGT 


CTTATCAT 




888 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

175 



WO 97/43303 



PCT/DS97/079S0 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 : 

CTGTGATTGT CTTTTTGGTC TATCTGATCA TTACTGTACA AAAGCTTGGT CGTCTCATCG 
ATGAAACAGA AAAGACGATT AAAACCTTGA CTTCAGATGT GGATGTGACC TTGCATCACA 
CCAATGAGTT GTTGGCTAAG GTCAATGTCT TGGCAGATGA TATCAATGTC AAGGTGGCTA 
CGATTGATCC ACTCTTCAGT GCTGTTGCAG ATTTATCTCT ATCTGTTTCA GACCTCAATG 
ACCATGCGCG TGTCTTGAGC AAGAAAGCTT CATCAGCTGG TTCAAAAACA CTCAAGACTG 
GTGCAAGTCT GTCAGCTCTT CGTCTTGCAA GTAAATTTTT CAAAAAATAA AAAAGGAGAA 
TCCTTATGGG TAAATTATCC TCAATCCTTT TAGGAACCGT TTCAGGTGCA GCTCTTGCCT 
TGTTTTTAAC AAGTGATAAG GGCAAACAAG TTTGCAGTCA GGCTCAAGAT TTTTTAGATG 
ATTTGAGAGA AGATCCGGAG TATGCCAAGG AGCAAGTCTG TGAAAAACAG ACAGAAGTTA 
AGGAGCAGGC TACAGATTTT GTTCTGAATA ACAAAAGAAC AGGTTGAGTC AGGTGAAATC 
ACTGTGGACA GTATACTTGC TCAAGGCTAA ATCCTATGCT TTTCAAGCGA CAGAAGCATC 
AAAAAATCAA TTAAATAATC TCAAGGAACA ATGGCAAGAA AAAGQTGCAG GAATTCGATC 
TCTTGATGAC TCAGAAGAGA TTGTGATTGA TATAACAGAA GAATAAACCA TCACCATCTC 
CGGACGGACT AAGTATCTGG GGATGGTGAT TTTTATCACG AATCTAGTCT TTGTGGCATA 
ATAATTACTA TGCAGAAAAA ACCAACGTCA GCCTATGTGC ACATCCCATT TAGTACCCAG 
ATCTGATAAC ATTGTGATTT TTCAAAGGTC TTCATCAAAA ATCAGCCAGT CGACAGCTAT 
TTAGAGCATC TGCTGGAAGA GTTTCGTTCT TATGATATTG AAAAGTTGTC AACCCTTTAT 
ATCGGTGCTG GAACACCGAC AGCCCTGTCG GCTCCGCAAC TGGAGGTGTT ACTGAATGGC 
TTGACTAAAA ACTTGGATTT GTCTGTCTTG GAAGAGTTGA CCATTGAAGC CAATCCAGGC 
GATTTGGATG CGGATAAGAT AGCTGTTTTG AAAAATTCGG CTGTCAATCG TGTTTCGCTA 
GGTGTCCAGA CCTTTGATGA TAAGATGTTG AAAAAGATTG GGCGCAGTCA TTTGGAGAAG 
GATATTTATG AAAATATCGA TCGCCTGAAA CTGGCTGGTT TTGACAACAT CTCCATTGAT 
TTGATTTATG CTCTGCCTGG TCAGACCATG GAGCAAGTAA AGGAAAATGT GGCTAAAGCC 
ATTGGATTGG ATATTCCCCA CATGAGTTTG TATAGTTTGA TTTTAGAAAA CCATACGGTC 
TTTATGAACC GGATGCGACG TGGGAAATTG CCTCTGCCTA AGGAGGAACT AGAAGCGGAG 
ATGTTTGAGT ACATCATTGC AGAG 

(2) INFORMATION FOR SEQ ID NO: 123 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) -LENGTH: 654 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:123: 

CTAAGTGACG ACCAAGCGTA GCCTCGCTAA TCACCAATTC TTGAGCCAGC TGATGGGCTA 
AAAACTGTTG GTGGTAGAGA AGATAAACCA AAATCTGGTA TTTAACAGCA CTATCCAAAA 
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TAGACAGACG 


CAGATTTTCA 


180 
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CCTTGTCATT GAGCAGGGTG 


240 


ACATATTTw 
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CCTGTTTCTT 


CCATTACAGC 


CTTGACGGTC 


300 


GTCTGAGACT 


CTTGTAATAG 


AAAGGAAAGG 


ATTGAAAATT 


GACCACACTC 


GGCTTTCTCC 


360 


ATCAAATCTC 


CTAAATACAT 


TTGTTATCCC 


TTTCATTTTC 


TACCTTAAGC 


ATAGCATAAA 


420 


TCTGACAAAT 


GCTAAAATAA 


TCTGTTTCTC 


TTTTTATTTT 


CATGCTGATT 


TCCTGGTCCA 


480 


TTATCCTGAA 


AATCAGTAAA 


CACACGGCTC 


CCCCTTTGGG 


CATTTTTATG 


CTAAAATAGT 


540 


AGCTATGGAT 


AAAATTATTA 


AAACTATATC 


AGAAAGCGGA 


GCCTTTCGTG 


CTTTTGTCCT 


600 


TGATAGCACG 


GAAACCGTCC 


GCACTGCTCA 


AGAAAAACAT 


CAAACCCAAG 


CTAG 
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(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 



CTTGAACATA 


TTCATCAAAA 


GAACTATGAA 


CAAAGATTAC 


ATCCGGGAAG 


AGTTCATTTG 


60 


CAACACGGAA 


TGGCTTCGCT 


CAGATTTTTC 


TTGGGACAGG 


CTGGTGGTCA 


CAGCCGTTTC 


120 


ATCATCAACC 


ATTGGTTGGT 


CGAAGCGTAG 


GCAACGCACA 


TCTGACAATT 


TTTGTTTCCA 


180 


GAAGGTCAAG 


GCCTCATCTT 


TGTCAACTGG 


AATTTCATTC 


CCGTTACTGT 


ACCAGTTGGC 


240 


AAAGAGAATA 


CCTAAAACAC 


GACTACCATC 


CACACCCTGC 


CAGTACATTT 


CTGAAAACTG 


300 


AGACGTAAAC 


TGCTCATCTT 


CAAGGACTTG 


GTTGTCAAAT 


CCAATCGGCT 


TCACACCACG 


360 


ACCAAAGGCC 


GCCACGTGAA 


TGCCTGATTT 


TTGAAGAATT 


TGAGGCGCTT 


GTCCCATATT 


420 


TCCAAAGGTA 


TCTGGAAAGT 


AGCCAATCTG 


GGTTGATTTA 


CCCCATTTGG 


CAGCTTCTTG 


480 


TTGACCAATC 


AAGGTATTGC 


GGACATTGGC 


TTCACTGGAG 


ATCAAATAAT 


CATCCTGCAA 


540 


GATGTAAAAG 


GGACCAATTT 


TAAGCTTGCC 


CTCGTCAATG 


TAGCGTTGGA 


CCTTGTCGCG 


600 


ATTTTCAGGG 


CGAATTTGTA 


AGTAGTCGTC 


AAGGACAATA 


GTTTGTCCAT 


CCAAGTGGAA 


660 


ACTCTTGAAC 


TCAGGGTCAT 


TTTCAAAGAG 


ATCAAAGAGA 


TTGTCAAACA ATTCCACCAA 


720 


TTGCATACGA 


TGGCTTTCAA 


AAGGCAAGTA 


CCACTCACGA 


TCCCAGTGGC 


TATGTGAGAT 


780 


AATATGTACA 


ACAACATTTT 


CCATGAAGTA 


AAACCTCATT 


CTAAATTTAA 


ATTTTAACGT 


840 


TTTAAATGTA 


AACTTGTGAT 


TCTGATAAGA 


ATCGGTTGAG 


CTAAAGCGAG 


CCCCTTAGCG 


900 


AATATCCAAG 


TAATCCAAGA 


CCAACTCACA 


GAACATCATG 


TTAGCCCAGG 


AGAACCATTC 


960 


ACGAGAGTAG 


AGGGTCGGAT 


CATCTACATG 


GAAGCTTTCG 


TGCATGACAC 


CTGTACCACC 


1020 


ATCGCAGGCA 


ACCAGCTGAT 


CCAGCAAGAA 


TTTTTTCTCT 


GCCTTATCTC 


TTGTTGTCAA 


1080 


TCCCTGGATA 


AAAAGGCGAA 


TGGGCCAGAT 


ATAGCGATAG 


AAGTATGAGA 


ACTTCCGAGA 


1140 


CCGCTAGCGT 


ATCTCCTTGG 


TAGAAGTATG 


GATTTTCAGA 


A 




1181 
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(2) INFORMATION FOR SBQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic UNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 



60 
120 
180 



CTCTGACGGC GTCGCCACTT AAGAAGAGTA TCAAAAAGAA AAATAGAAAA TTAACTAACA 
AGGNAGAAAA ACACATGTCT AAAATTATCG GTATTGACTT AGGTACAACA AACTCAGCAG 
TTGCAGTTCT TGAAGGAACT GAAAGCAAAA TCATCGCAAA CCCAGAAGGA AACCGCACAA 

CTC CATC TGT AGTCTCATTC AAAAACGGAG AAATTATCGT TGGTGATGCT GCAAAACGTC 240 

AAGCAGTCAC AAACCCAGAT ACAGTTATCT CTATCAAATC TAAGATGGGA ACTTCTGAAA 300 

AAGTTTCTGC AAATGGAAAA GAATACACTC CCACAAGAAA TCTCAGCTAT GATCCTTCAA 360 

TACTTGAAAG GCTACGCTGA AGACTACCTT GGTGAGAAAG TAACCAAAGC TGTTATCACA 420 

GTTCCGGCTT ACTTCAACGA CGCTCAACGT CAAGCAACAA AAGACGCTGG TAAAATTGCT 480 

GGTCTTGAAG TANAACGTAT TGTTAACGAA CCAACTGCAG CAGCTCTTGC TTATGGTTTG 540 

GACAAGACTG ACAAAGAAGA AAAAATCTTG GTATTTGACC TTGGTGGTGG TACATTCGAC 600 

GTCTCTATCC TTGAATTGGG TGACGGTGTC TTCGACGTAT TGTCAACTGC AGGGGACAAC 660 

AAACTTGGTG GTGACGACTT TGACCAAAAA ATCATTGACC ACTTGGTAGC AGAATTCAAG 720 

AAAGAAAACG GTATCGACTT GTCTACTGAC AAGATGGCAA TGCAACGTTT GAAAGATGCG 780 

GCTGAAAAAG CGAAGAAAGA CCTTTCTGGT GTAACTTCAA CACAAATCAG 830 

(2) INFORMATION FOR SEQ ID NO:126: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1153 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEONESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

CTGGGAAAGG AGATATGTGA CAATAATAAA CTGTATTCGT TGATAGAATT TAGAAATAAA 
ATATATGAGA ATTAGAACTT TCCAGAAGTG ATTTAGCGAT TTTACTATGT GCCANGCTTA 
TCGCCTCTAT CGGATTAAAT ATGGATTCGA CTCCCGTGAT TATNGGAGCC ATGTTAATCT 
CTCCTTTGAT GACACCTATT CTGGGAGTGG GGCTCTCTCT AGCTATATTT GATTTTAAAT 
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TGTTAAGAAA ATCTTTTAAA ATATTAGCTA TTCAAATTCT TGCCAGTTTA ATAGCTTCAA 300 

CACTTTATTT TTATCTTTCT CCCATTTCGT ATGCTAGTTC GGAAATTGTT GCTAGAACCT 360 

CTCCGACTAT TTGGGATGTT CTCATTGCTT TTGTAGGAGG GATAGCAGGT ATTATTGGTG 420 

CTAGGAAAAA AGAGACCAAT AATATTGTTC CTGGTGTTGC TATTGCAACC GCCTTGATGC 480 

CTCCTCTTTG TACAGTAGGT TATGCTATTG CTTCTGCTAA TCTAAAATTT ATCATAGGCT 540 

CCTCTTACCT ATTCCTCATC AATTGTAGCT TTATTGTCAT TGCGACTTAT ATAGGTGTTA 600 

GGTTGATGAT GGTTAAGAAA CATTATTTTA AAGATAATGA AGAAGACTCT AAAATGCGTA 660 

GGATTTTGCT TCTAGTTGCT GTTTTGCTGA TGATTCCGAG TTTCATCTCT GCAACGACTT 720 

TAGTGAGAGA AACGTTGAAA AAAGAGTCCC TTAAGAAATT TATATCAGAG CAGTTTCAGG 780 

GGCATAATAT TTTGAAAAAA ACCTATTCTA AAAAGACTCA TACCCTAAAG CTAACCATTT 840 

CAGGAAATTA TTTGACAGAA GAAGAACTCG ATATGATTTC AAGTAAGAGA GGTGACTATG 900 

GTTTAAGTGA TGTTTCTGTT CAAGTTTCAC AATTGTCTGA TTCAGAACAA CTTAGTAAGG 960 

AAGAACTGGT GGAGTATTTC TTCCAGTATA TCAAGGATAA GGAAGCAAAA GAAAAGGAAA 1020 

AAGCTAATAA GTTTTATACA GAGTCTGAGG AGCAATAATT TCTTGAGAAT AGCTG GTT TT 1080 

TCTCGTGAGT CTTCTATGTA TATCAAAGGA AGACTGAGGT TTTAAGTATG AAACTTTTCT 1140 

TCTATTATAG TAG 1153 

(2) INFORMATION FOR SEQ ID NO:127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1058 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 

CTGCAATGGT TAGCGAGGAT GGTGAAATAG AGGAGCCTTA ACTATTTTGA GGAATCCAAG 60 

CTGACAGGAC TGGTATGTCT GCTCTTTGGC TCTGAAGGCA GACTATACCA GCTCGAAGTT 120 

TGACAGATGT GGCACGTGAG AATTATTGGA GGCNTTGACN TANTCAGAGG ATTCTCTCTT 180 

GATATTGGGT TTGGGAGAGT TGGATAGTGA GTTGGAAAAT TACCAAGCGG TTATTCAAGC 240 

CTATGCCCAG TTAGATAATC GCTCGATTTA TGAGCAAACG GGCATTTCCA CCTATCAACG 300 

AATTGGCTTT GCCTATGCTC AGTTAGGGAA ATTTGAAACG GCTACTGAGT TTTTAGAAAA 360 

AGCCCTGGAG TTAGAATACG ATGACTTAAC AGCTTTTGAG TTGGCCAGTC TTGATTTTGA 420 

TCAAGAAGAA TATCAAAAAG CCACCCTCTA CTTTAAGCAG CTTGATACCA TTTCTCCTGA 480 

CTTTGAAGGC TATGAGTATG GGTACAGTCA GGCTTTACAT AAGGAACATC AAGTTCAAGA 540 

AGCCCTGCGT ATCGCTAAGC AAGGATTAGA GAAAAATCCC TTTGAAACTC GCCTCTTGCT 600 

AGCTGCTTCA CAATTTTCTT ATGAATTGCA TGATGCTAGT GGTGCAGAAA ATTATCTCCT 660 

TACTGCAAAA GAAGACGCTG AGGATACAGA AGAAATCTTG CTTCGTTTAG CCACTATTTA 720 

TCTGGAGCAG GAGCGTTATG AGGATATTCT AGACTTGCAG AGTGAGGAGC CAGAAAATCT 780 

TTTGACCAAG TGGATGATTG CTCGTTCTTA TCAAGAAATG GACGATTTGG ATACTGCTTA 840 
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TGAGCATTAT CAAGAGTTGA CAGGAGATTT GAAGGACAAT CCAGAATTTC TGGAACACTA 900 

TATCTATCTC TTGCGTGAAT TGGGACATTT TGAAGAAGCA AAAGTCCATG CTCACACTTA 960 

CTTAAAACTG GTTCCAGATG ATGTGCAAAT GCAAGAACTG TTTGAGAGAT TGTAAGAATG 1020 

TTTAAACATA TAGAACTGTA GTTTATCTCT TTTGATAG 1058 

(2) INFORMATION FOR SEQ ID NO: 128: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DMA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

CTTGTTGCTT GTCCAATTGG TGTAAATTTG GAACCGATTG ACCCATCTGC AAAGAATGCT 60 

AGAGGAAACA CAGGAAATTG TTGCAGGTCG TGTTGCTAGT GTTGAAACAT TGAAGCGAAT 120 

AGAGGAGTTA GGTTTTGACT TTGTCTGTTT GACTGGAAAT CCTGGAACAG GAGTTAGCAA 180 

TCGAGAAATC ATTAAGGCTG TTCAAACTGC TAAGGAAAAC TTTTCTGGTT TGATTATTGC 240 

AGGTAAGATG CACGGGGCAG GAGTGAATGA GCCTGTGGCA GAGCTTTCTG TCGCAGAGCA 300 

ATTGTTGGAA GCAGGTGCGG ATGTGATACT TGTTCCAGCA GTTGGAACCG TTCCAGCCTT 360 

TCATGACCAA GAGTTGCGTG AAGTCGTTGA TCTCGTTCAT AGTAAGGGTG GTCTTGTATT 420 

GAGTGCTATT GGTACTAGCC AAGAAACATC TGATACAGAT ACTATTAAGG AAATTGCACT 480 

TAGAAATAAA ATTTGTGGAG TTGATATTCA ACATATAGGT GACGCAGGAT ATGGGGGACT 540 

GGCAACAGTC GATAATATTT ATGCATTGAG CAAGGCAATT AGAGGAGTGA GACATACAGT 600 

ATCTCGCTTG GCTAGGTCAG TAAATAGGTG ATAAAGGAGT AAGCTATGGC TAAAGTAACA 660 

ATTATGTTAG CATGTGCAGC AGGTATGAGT ACAAGTCTGC TAGTGACAAA GATGCAAAAG 720 

GCAGCAGAAG ATAAGGGGTT GGATGCAGAA ATTTTTGCAG TTCCAGCTCC TGAAGCAGAA 780 

GAAATTGTAG CAACAAAAGA AGTAAATGTG TTGCTTTTAG GTCCTCAAGT TCGCTATTTA 840 

CTAGGGGACT TTCAAGAAAA ACTAAAAGAT AGACAGATTC CTGTGGCGGT TATTCCGATG 900 

ACAGATTATG GAATGATGAA TGGTTCTAAA GTCTTAGATT TAGCTGAAAG TTTATTAGAC 960 

TAAGATTGAG GAGAATGTTA TGGATGAAAG TAATTTAGAA TCTGTAATGG GGCTAATTAT 1020 

GTATGGTGGG GAAGCCAAAA GTAATGCTAT GGAGGCTATT CAGGCAGCAA AAAAAGGGGA 1080 

TTTCTCTAAA GCCAATCGAA GATTAGCTGA TGCGAATGCT GCCTTATTAC AGGCGCATAA 1140 

GGCTCAAACA GAAATGTTGA CAAGAGAGGC ACAGGGGGAA GAAACATCAA TTAGCCTCTT 1200 
GATGGTACAC G 

1211 

(2) INFORMATION FOR SEQ ID NO: 12 9: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 858 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 



CTTCAGCCAT 


CTTTAGTAAA 


GACTTCCTGT 


GTTGACAATT 


GGTTGGGCAT 


CATGATTGCC 


fin 


ACAAAGAACG 


ACAAGGAGAT 


TTGAAACCAT 


GGCAGCTTTT 


CGTTCTTCGT 


CAAGTTCTAC 


120 


CAATTCCCCT 


TCATTGAGCC 


GTTCTAGTGC 


CATTTCAACC 


ATTCCTACAG 


CACCATCTAC 


ieo 


AATCATCTTC 


CGTGCATCAA 


TAATGGCAGA 


TGCTTGTTGG 


CGTTGAAGCA TAACGGCAGC 


240 


AATTTCTGGA 


GCATAAGCTT 


AGGTAAGTGA 


TACGTGCTTC 


AAGGATTTCC 


AAGCCAGCAT 


300 


CTTCTACGCG 


ACTTTGAATT 


TCCTCACGAA 


TACGGTTAGC 


AACAATTTCG 


CTAGAGCCAC 


360 


GGAGACTACC 


TTCATCTGCT 


TGCCCGTCAC 


CCGTAGTATC 


CACATTAGGA 


GACACATCGT 


420 


AAGGATAGAT 


GCGGACAATA 


TTACGGAGGG 


CGCTATCACA 


TTGCAATGAA 


AGATATTCTT 


480 


TGTAGTTATC 


AACGTTGAAG 


ACTGCCTTAG 


CGGTATCGAC 


AACTCTCCAA GTTACCGCGA 


540 


TACCGATTTC 


TACAGGGTTT 


CCTAAGCAAT 


CATTGATTTT 


TTGACGAGAA 


TTGCTCAAGG 


€00 


TCATGACTTT 


GAGGGAAATG 


TGTTTCTTGC 


CAATTTCAAG 


ATTTACATCA 


TTGTCATTTG 


660 


ATGATTTAGC 


TCCTAAAAAA 


GGAGATTTTG 


TGCTAACATC 


ACCACTTTGT 


CCAAGTCGAA 


720 


TGTGGTTTGC 


AGGGTTGACT 


GCTACGCTGA 


AGGGATTGAC 


AAAGTAAAAA CCAGGTTCTT 


780 


TCATGGTACC 


TGTATAATTA 


CCAAAGAGTG 


TCAGAACCAG 


AGCTTCCTGA 


GGTTTGACAA 


840 


CTTTTAAACC 


AGCATGAG 










858 



(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 723 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

TTACTTCGCT AAACAATCAA AGGAACAGTC AAAAATTCAG TTGGAGCTGG TGATTCTATG 60 

GTTGCTGGAT TCACAGGTGA ATTTGTCAAA TCAAAAGACG CAGTAGAAGC CTTCAAATGG 120 

GGAGTGGCTT GCGGAACGGC AACTACCTTC TCAGATGACT TGGCAACGGC GGAATTTATT 180 

AAAGAAACAT ATGGAAAAGT TGAGGTAGAA AAACGATGAA AATTCAAGAT TTATTGAGAA 240 

AAGATGTCAT GTTGCTGGAT TTGCAGGCAA CTGAAAAGAC TGCTGTCATT GAAGAGATGA 300 

TTAAAAGCCT AACAGGCCAC GGTTATGTGA CAGATTTTGA AACCTTTAAA GAAGGCATTT 360 

TGGCGCGTGA AGCTTTAACT TCTACTGGTT TGGGTGATGG AATCGCTATG CCTCACAGCA 420 
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(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 864 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

182 



480 
540 
600 
660 
720 
723 



WO 97/43303 

PCT/US97/07950 

AAAACGCTGC TGTCAAAGAA GCGACAGTTC TCTTTGCTAA GTCAAACAAG GGTGTTGACT 
ATGACAGTTT GGATGGGCAG GCAACTGACC TCTTCTTCAT GATTGCAGCT CCAGAAGGTC 
CCAATGATAC TCACTTGGCA GCATTGACAG AATTGTCTCA ATACTTGATG AAAGACGGTT 
TTGCAGACAA ACTTCGTCAA GCAACATCTG CAGACCAAGT TATCGAACTT TTTGACCAAG 
CTTCAGAAAA AACTGAGGAA CTTGTTCAAG CACCTGCTAA TGACTCTGGT GACTTTATCG 
TAG 

(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 882 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

AATCTCGTTG ATTTAGACTT TAACCGTGTC ATCTATTTTA GGCGCTGGTC CTTTTCTTTG 
GACTTCCTCA TGAAGCTCAG CTCAAGATTT TGGAATTAAC TGCTGGTCAA GTTGCGACCA 
TGTATGAAAG CCCAGTTGGC TTCCGTCACG GTCCAAAATC TCTTATCAAC GACAATACAG 
TTVrVTTGGT CTTTGGTACA ACGACGGACT ACACTCGTAA GTACGACTTG GACTTGGTTC 
GTGAAGTTGC TGGTGACCAG ATTGCTCGTC GTGTTGTGCT TTTGAGTGAT CAAGCTTTTG ^ 
GTCTTGAAAA TGTCAAAGAA GTGGCCCTTG GTTGTGGCGG TGTCTTGAAT GATATTTACC 360 
GTGTCTTCCC TTACATCGTT TATGCCCAAC TCTTTGCCCT ATTGACTTCA CTCAAGGTAG 420 
AAAATAAACC AGATACACCG TCTCCTACAG GTACAGTAAA CCGTGTAGTA CAAGGTGTCA 480 
TAATTCACGA ATATCAAAAG TAAGACAGTG TTTATGAATT CTTGACAAGA GGATTTGTAA 
ATTATCAGAT AAACCATAGA TTGTCAGTAC GCTTTCTATG GTTTGTTTGC TTGAGAGAAA 
TAGTAAAAGG AGAACAGAAT GAAAGCATAC ACAGAGCGTG TATTTGGAAA TGTTGAGGGT 
GAGGATGTCT TGGCCTATCG ATTTGAGACA GACGGTGGCT ACCAACTTGA GGTTATGACT 
TATGGTGCGA CTATCTTGCG CTATGTCGCA CCTGACAAGG CTGGAAATTT TGCCAATGTT 
ATCTTGGGAT TTGATGACTT TGATAGTTAT GTAGGCAATA GTCCCAAGCA TGGAGCAAGT 
GTAGGTCCTG TAGCGGGTCG TATTGCAGGT GCGACCTTTG AG 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 



CTCAAGGGCT TGAAATCATC CAGATAAATG AAGAACAAGG CCAATTTGGT TTTGACCGCG 60 

TTAAGGAATT GGTTGATACT TACAAGGTGG AACGATTTGT AGTGGGCTTG CCTAAAAACA 120 

TGAACAATAC AAGTGGACCG CGCGTAGAAG CTAGTCAAGC CTACGGAGCA AAGCTAGAAG 180 

AGTTTTTTGG TTTACCAGTA GACTATCAGG ATGAACGCTT GACAACAGTG GCTGCTGAGC 240 

GCATGTTGAT TGAACAAGCA GATATCAGTC GCAATAAGCG CAAGAAAGTC ATTGATAAGT 300 

TAGCAGCTTC AGCTGATTTT ACAAAATTAT TTAGATAGAA AATTTTAATA TAAAGGAGAG 360 

GCTATGTCAC ACGATCATAA CCACGACCAC GAAGAACGTG AACTAATCAC ACTAGTAGAT 420 

GAGCAAGGAA ATGAAACCTT GTTTGAAATC CTTTTGACAA TTGATGGAAA AGAAGAATTT 480 

GGTAAAAACT ATGTTCTTCT AGTACCAGTT AACGCAGAAG AAGACGAAGA CGGACAAGTT 540 

GAAATCCAAG CTTACTCATT CATCGAAAAC GAAGATGGAA CAGAAGGCGA ATTGCAACCA 600 

ATCCCAGAAG ACTCAGAAGA CGAATGGAAC ATGATTGAAG AAGTCTTCAA CAGCTTTATG 660 

GAGGAGTAAA AACGTCCGGG GGACGTTTTT AGCCCTGCGC TAGAAATTAG AAACGCAGGT 720 

AGTCTGAGAG ACTGTATCAG CCCAACTCCC AGAAATGGGG AAGAGTTGGA ACGTCCAGTG 780 

AACATTTTTA CCCCAACTCC TAGAAATTGA GAAGAGTTGG AACATCTAGT GGATATTTTT 840 

AGCCCACGTT CACATTCATA GTAG 864 

(2) INFORMATION FOR SEQ ID NO: 133: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 952 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 



GTATCGTTCA ATTCCAAAGA . TACGGAGGAA GCTGCGTTTC GCGAGAAAAC TCTTGTTAGA 60 

CCAGATAGGG ATTAGCCAAA ATCCTTCTAT GNGCTTATAT CCAAATGGAG ATGCAAACTT 120 

CTGCCCAACT CTTGACCAAT AAGATCTTTC TTAAAAATCC ACTCAAGGCT CTGGTAGAGG 180 

AAAAATATGG GATTGAGTAT GAAGAATTTA CCAATCCTTG GCACGCTGCC ATTTCTAGCT 240 

TCGTTGCCTT TTTCCTTAGA AGTTTGCCTC CAATGCTGTC AGTGACCATA TTCCCAAGTG 300 

AATATCGCAT CCCTGCTACT GTCCTTATTG TCGGTGTGGC CCTTCTTCTC ACTGGTTACA 360 

CTAGTGCCAG ACTTGGAAAG GATCCGACTA GAACAGTATG ATTCGGAACC TTGCTATTGG 420 

TCTCTTGACC ATGGGAGTTA CCTTCCTGCT CGAACAACTT TTCAGCATTT AGAATACAAG 480 

AAATACCTCG ATTTTGAAGT CGAGGTATCT TTTTTACATT TGCACAATCT TGCGATAACT 540 

TCTTGAAGTA ATCATGAAAA TCAGCACATA GGCGATGAGG AAGATAGCGC AGATAGACAA 600 

GGTCACAATC AACATCATAG TCGTATCCAG TACACCAATC ACTTTTAAAA TCAGGCTAAG 660 
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CATATGGTAG GCAAAGGCGA GATGTATGAA GGCAAAGAGC AAAGGAAGGA AGAAAACAGT 720 

TAAAACCTGT TTGTTGATGG TTTGCTTGAT TTGCTTTTGG TCCAAACCGA CTTTCTGCAA 780 

GATAATAAAG CGTTCACGGT CTTCGTAGCC TTCAGAAATT TGTTTGTAGT AGATGACCAG 840 

AACAGTTCCG ACCATAAAGA TAATGGATAG GAAAATACCG ATAAAGAAGA CACCGCCAAA 900 
GAGGACACTC ATTTGAGCAC TAGCATCTGC TAGATTGCTA CCATAAACAT AG 



(2) INFORMATION FOR SEQ ID NO: 134 : 



952 



(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 1133 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

ACAGAATTAC TCACTATTCT GCGTCTTCTG CTAAAAGTTT AAACTCGTGT TTCTTCATCC 60 

AAAAGTGGGT AACCTAGGCG TTTGCGTTCT GCAACAAAGG TTTTGGCTAC AACACGGGCC 120 

AAGTTACGGA TACGGAGCGA TATAGCCTGC ACGGCTCTGT TACAGATACG GCACCACGCG 180 

CGTCAAGCAG ATTAAAGGTA TGTGAACATT TGAGAACATA GTCATAGGCA GGGTGTACCA 240 

AGCCTTCTTC TAATGCACGA CCAGCTTCTT TTTCAAACTT ATCAAAGTTT TCAAGCAACA 300 

TTTCTTGGTT CGAAATTTCA AATGAATATT TTGAGTGCTC ATACTCAGGC TGGATAAAGA 360 

TTTCTCCGTA TTTTACACCA TCAGCCCACT CGATATCATA GACAGAGTCT ACTTCTTGAA 420 

TGTAAGAAGC CAAGCGCTCC AAACCATAGG TAACTTCCGC AGTCACAGGG CCAGTTGCCA 480 

ATCCACCGAC TTGTTGGAAA TAAGTGAACT GAGTGATTTC CATTCCGTCA AGCCAAACTT 540 

CCCAACCAAG ACCAGCTGAA CCAGTTGATG GGTTTTCCCA GTTGTCCTCA ACAAAACGAA 600 

TATCGTGCTC CAAAGGATTG ATTCCCAATT TTTCCAAAGA CTCAAGGTAA AGTTCTTGGA 660 

TATTTGATGG AGAAGGCTTC ATGACCACCT GGAATTGGTG GTGTTGGTAG AGACGGTTAG 720 

GGTTTTCCCC ATAACGACCG TCAGCAGGAC GACGTGATGG CTCTACATAA GCTGCATTCC 780 

ATGGCATCAG GTCCGATAGG CACGAAGGAA AGTGTAAGGA CTCATTGTTC CCGCACCTTT 840 

TTCATTATCA TAAGCCTGCA TAAGCATACA ACCTTGGTCA TTCCAAAATT GTTGCAAAGT 900 

CAAAATAATT TCTTGAAATG TTAATTCCTT AGACATTGAA TACTCCTTAA TAAAAAATAA 960 

TTTCTTGCGA CACGAGTTCT GCCTCGTCGA TTATACCNAG GCACANAAAT TANTNCTNGC 1020 

TTCCAAATTC GAATCCCGai NCCCCGGGGG ATTCNNTTTT CNTAAACCGG CCCCCCCCCN 1080 

GNGGGAACCC CCCCNTTTTT NCCNTTATTT AAGGGTTANT CNCCCNCTTG NCN 1133 

(2) INFORMATION FOR SEQ ID NO: 13 5; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 5: 

ACCTAATTTT TTCATTAGAT CCTCCAATTT ATTAGGGCTT TGCCCCTTAT TTTAACAAAT 
GTTTATTTTT CAGTTTCAAA TATCGTTGTT TGGGAGCGAT AAAGAAGCTA ATGAGAAAGA 
AACTAGCAGC TGTAAGCACG ATACTAGAAC CTGCCGCAAC ATTAAAACTA TAGCCAATAA 
AGAGTCCCAA AACTGAAGCA GTAGCTCCGA AGGTTGAGGA AAGGAAAATC ATGCTTTTCA 
GACTATTAGC ATACAGATAA GCAGTTGCAG CTCGGGTAAT CAGCATGGCT ACAATCAGGA 
TAGTTCCGAC ACTTTGCATG GCTGTCACAG ACACGAGAGT CAGGAGTACC ATGAGAAGGT 
AGTGATAGAA ATTGACAGGC ATTCCCATGG CTTTAGCCAA GAGTTCATCA AAGGAAGTTA 
TCAAGAGTTG CTTGAAGAAA ATCCAGATTA ACAAGAGGAT GGCTGCCCCC ACACCCATAG 
TAATAAACAT ATCCGTATCT TGGACGGCCA GGATATTACC AAAAAGGATA TGGAAAAGGT 
CAGTTGAACT TTTAGCGACA CCAATCAAGA TGATACCGAG GGCTAAGAAA GAAGAAAAGG 
TAATGCCGAT GGCGGTATCG CTTTTGATAA TCGAGTTTCC TTTGATGTAG GTAATGATGA 
TGGCAGCTAG CAATCCAAAG ACAATGGCTC CGATAAAGAA TCAAGGCCCA AGATGAAGGA 
GAGGGCTACA CCTGGTAAGA CAGCATGTTG AAATGGCATC TCCCATGAAT TGACATCCCG 
CGTAGAATAA TGAAACATCC CACAGA 

(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1096 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE. .DESCRIPTION : SEQ ID NO: 13 6: 

CTAACATTTC CTTTGCTACT TCTAATAATG TCATAGTGGT CATTAAATGA TCTTGAGCAT 
GTACCATGAT AATTTCAATT TTAATTTCCA COCCACTTGC GTATTCTTGC AAGAGTTTGG 
TTTGTGCATG ATGCGCTTCA AGAATTATCT CATTTGATTG ATTTAATTTA CTTTCTGCAT 
CATCAAAACT ACCTTCTCTC ATTTTTTGCA AATGCTTCAT GTATTTCTGA CCTTGCATTT 
CCCGAATGCA GGATAATTTC AAATGCTGCA ACCTGCAGTT CCTCTTGATT CATATGAACC 
TCCTATTTTA TCTTCTCAAA TATGTTAATA AAATCTTCAA AGTTATTGCA AGATATTAGC 
TGATTTTGCA ATTCATCATT CTCTGTCAAA GAGACTATCT TTTTAGTCAC AGTTGCCAAA 
CCTTCCGTTC CCATATATTG ATGGAGATAG AAGAAATAGC TAGTCTGGAC ATGTGAACTT 
TGATTATCCC AGAGTAACGA ATCTTTACAA ATTGCAACCG AAACCTTTTC CTCTGTACCA 
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AAGGGCTGAA TAGGATGCGG AACTGCAATT TTTTCAGAAA AAACAACTGA ACTTAATTCT 600 

TCGCGCTGTT TAATTCCATA AAGTAAAGAT TGTTCAAACT CATTTGATTC ACCAACAGAT 660 

AAACTCTCAA CCATCTTTTC AAGTAAATTT ACCTTGTCTG ATTCAGTACA TATTAAAAAG 720 

TTTTCTTTAC TAAAATACTG TCTAAAGCCG TTGTTTTCAA ATTTGTTAAT CTTTGATGAT 780 

TGTACATAAC TAGAAACTTG CATCTGAACC ATAGCTTTTC TAATCATTTC CATCTCATCA 840 

CTCTTAAGAA ACACACTAAC TTTAAAAACT GGGATTTGAA AATATAGATT TGATAAATCA 900 

ACAGCTGACA CTATAAAATC TATTCCTTTA AGTTTTTCTT GATTCAATTC ATAATAGCCT 960 

ATTACATCAA CAACTTCTAC TCGCTTCCCA AACTCCGTTT CCAAACGATT TCTTAACATT 1020 

TGGGCTGCAC CAAATCCTGT TGCACAAATA GCAAGAATAT TAAACTTAGT ACTCTCTTTG 1080 
CTACGTTCCA TAGCAG 



1096 



(2) INFORMATION FOR SEQ ID NO: 137: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1124 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 



GGGGATTTTG 


GTTGCTGGGG 


TAACACGTGG AGATGTTCAA 


ATGTGAAAAT 


ATCACAGATA 


60 


CTTCAAGCGT 


GTTAAGGACA 


TCCCTTTGAC TTTGCCAGAA 


GAACTAGATA 


TCACAGTTCG 


120 


TGGGGAATGT 


TACATGCCAC 


GCGCTTCCTT TGACCAAGTT 


AACCAAGTGC 


GCCAAGAAAA 


180 


TGGAGAGCCT 


GAATTTGCTA 


ATCCTCGTAA TGCGGCAGCA GGAACTATTG 


TGTCAGTTGG 


240 


ATACAGCAGT 


AGTTTCCAAG 


CGTAATCTTG CAACGTTTCT 


TTATCAAGAA 


GCCAGCCCTT 


300 


CAACTCGTGA 


TAGCCAAGAA 


AAGGGTTTGA AGTACCTAGA 


ACAACTAGGT 


ATTGTGGTCA 


360 


ATCATAAGCG 


AATCTTTGGC 


TGGAAAATAG ATAAAATATG 


GAATTTTATC 


CAAGAAGTAG 


420 


GACAGGAACG 


GGAAAATTTG 


CCTTAAGATA TTGATGGAGT 


GGTAATCAAG 


GTCAACGACC 


480 


TAGCAAGTCA AGAAGAACTT 


GGTTTTACCG TTAAGGCTCC 


AAAGTGGGCA 


GTAGCCTACA 


540 


AGTTCCCTGC 


TGAAGAAAAA 


GAAGCTCAAC TCTTATCAGT 


TGACTGGACA GTTGGCCGTA 


600 


CCGGTGTTGT 


AACTCCAACT 


GCTAATCTAA CACCAGTACA ACTTGCCGGT ACGACTGTTA 


660 


GCCGTGCGAC 


CCTGCACAAT 


GTAGATTATA TTGCTGAAAA 


AGATATCCGA 


AAAGACGATA 


720 


CGGTCATTGT 


ATATAAGGCT 


GGTGACATCA TCCCTGCCGT 


TTTACGTGTG 


GTAGAGTCCA 


780 


AACGGGTTTC 


TGAAGAAAAA 


CTAGATATCC CTACAAACTG 


TCCAAGTTGT AACTCTGACT 


840 


TGTTGCACTT 


TGAAGATGAA 


GTGGCCCTAC GTTGTATCAA 


TCCGCGTTGC 


CCTGCTCAAA 


900 


TCATGGAAGG 


CTTGATTCAC 


TTTGCTTCTC GTGATGCTAT 


GAATATTACA 


GGCCTTGGTC 


960 


CATCTATTGT 


TGAGAAGCTT 


TTTGCTGCTA ATTTAGTCAA 


GGATGTGGCG 


GATATTTATC 


1020 


CGTTTGCAAG 


AAGAAGATTT 


CCTCCTTTTA GAGGGGGTTA AGGAAAATCC 


GCTGCTAAAC 


1080 


TGTATCAGGC 


TATCCAAGCA 


TCAAAGAAAT TCCTGCCGAG 


AAGA 
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(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

CTTGGTAGAA CTTGCTAATC AAGCTGGCAA GCCTGTAGTC TTGGACTGCT CAGGTGCAGC 60 

ACTTTCAGGC TGTTCTTGAA TCACCCCATA AACCAACAGT CATCAAACCA AATAATGAAG 120 

AATTGTCTCA GCCTTCTTGG AAGAGAAGTT TCTGAGGATT TGGATGAATT AAAAGAAGTA 180 

CTTCAAGAAA CCTTTGTTTG CAGGGATTGA ATGGATTATC GTTTCACTTG GTGCCAACGG 240 

TACTTTTGCC AAACATGGTG ACACTTTCTA CAAGGTAGAT ATTCCTAGAA TTCAGGTGGT 300 

AAATCCTGTT GGATCTGGAG ACTCTACTGT GGCAGGAATT TCTTCAGGAC TTCTTCACAA 360 

AGAATCGGAT GCAGAATTAC TCATCAAGGC AAATGTCCTT GGTATGCTCA ATGCTCAAGA 420 

AAAAATGACT GGTCATGTCA ACATGGCCAA CTATCAAGTT CTATATGATC AATTAATAGT 480 

AAAAGAGGTA TAAAATGGCT TTAACAGAAC AAAAACGAGC ACGCTTAGAA AAACTTTCTG 540 

ATGAAAATGG TATCATCTCA GCTCTTGCAT TTGACCAACG TGGTGCTTTG AAACGCCTCA 600 

TGGCTCAACA CCAAACAGAA GAACCAACTG TGGCTCAAAT GGAAGAACTG AAAGTCTTGG 660 

TAGCAGATGA ATTGACTAAA TACGCTTCAT CAATGCTTCT TGACCCTGAG TATGGACTTC 720 

CAGCAACTAA AGCTCTTGAT GAAAAAGCTG GTCTTCTCCT TGCTTATGAA AAAACAGGTT 780 

ATGACACAAC AAGTACAAAA CGCTTGCCAG ACTGCTTGGA TGTTTGGTCT GCAAAACGTA 840 

TTAAAGAAGA GGGTGCAGAT GCAGTTAAAT TCTTGCTTTA CTATGATGTA GATAGTTCAG 900 

ACGAACTCAA CCAAGAAAAA CAAGCTTATA TCGGAGCGTA CCGGTTCTGA GTGTGTGGGC 960 

TGAAGATATC CCATTCTTCC TTGAAATCCT TGCTTACGAT GAAAAAATTG CACACGCAGG 1020 

TTCTGTACAA TATGCGAAAG TAAAACCACA CAAAGTGATC GGTGCTATGA AAGTCTTTTC 1080 

ACACCCACGC TTTTAACATT GATGTCTTGG AAAGTTTGAA AGTTCCTGTT AACATTAAAT 1140 

ATGTTGAAGG CTCCCCTGAA AGGTGAAGTG GGTTTACACA CGTTGAAGAA ACCAGCAGCC 1200 

TTCTTCAAAG CGCAAGATGA AGCAACGAAC TTGCCATACA TTTACTTGAG TGCTGGTGTA 1260 



TCAG 



(2) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

187 
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(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139 : 

CTGAACTGGA TTTTGCTACA AGGTCTGATT TGTCAAAGAG ATGCGAGTTA TGATATGAAG 
CAGGATGATT TGGATAAGGT AGCAGATTAT CTCTTCAAAA CAGAAGAATG GACCATGTAT 
GAGTTGATTC TTTTCGGTAA CCTCTATAGT TTCTACGATG TAGACTATGT CACTCGGATT 
GGTAGAGAAG TTATGGAGAG GGAGGAATTT TACCAAGAGA TTAGTCGCCA TAAGAGATTA 
GTGTTGATTT TGGCCCTCAA TTGTTACCAG CATTGTTTAG AGCATTCTTC TTTTTATAAT 
GCCAACTATT TTGAGGCTTA TACAGAGAAG ATTATTGACA AAGGTATTAA GCTTTATGAG 
CGTAATGTTT TCCATTATTT AAAAGGTTTT GCCTTATATC AAAAAGGACA GTGTAAAGAA 
GGCTGTAAGC AGATGCAAGA GACCATGCAT ATTTTTGATG TGTTAGGTCT TCCAGAGCAA 
GTAGCCTATT ATCAGGAACA CTACGAAAAA TTTGTCAAAA GTTAATTTTC CCAAATAAGG 
GAAAAAATAA AAAGCTCCTT TCGGTTTTGA TACAATAGTT TCAAAATTTG AGAGGAG 

(2) INFORMATION FOR SEQ ID NO: 140: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 623 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

CGTGTTCCCA GATGGCCGCA TCCATTTATT GAGCCAGACG TGTCCAAACT CATCAGGGTT 
ATCTTCGATA ACCAAACGAA TACGAGTGCT TCGTCGCCAC CAAGGTCTGG TAGTTGACGA 120 
TGTTCAAAAG AGCCGTTTCG ACCAACTGAC ATTGGGCTAG AGGTCCTTCC ACCTGCACAA 180 
TCGGTTCATT AGCAAAAACC AAATCCCCTT CTTGGGCAGA ACGAACGGTC AACTCCAACT 
TGAAATTGCG AAGGTAATCC AAGAACGCCC CATGATAACC AAGCGACTCC AAATAGGCTA 
TATCACTATC TGAAAAACGC AAGTCTTCAA GATAGTTCAC AATTCTTTCC AAACCTGCAA 
AAACCGCATA GCCGTTCTTA AAAGGCTGTT GGCGGAAATA CACCTCAAAG ACCGCCTTCT 
TATTGTAAAT CCCTTGATCA AAGTAAACCT GCATCATGTT GATCTGGTAC AAGTCCGTGT 
GCAATGTCAA ACTATCATCT GGATACATAC TTTTCCTACT TCCTTAGCTA GAAACCCATG 
AAAATTTTCA AGAACTTTCA TGTATTCCAA TAAATTAGTA CTATTATATC ACATTTTAGC 600 
TGGATTGAGA AAAGAGTAAC AAG 



60 
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(2) INFORMATION FOR SEQ ID NO: 141: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 609 base pairs 
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(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141 : 

CCATGTTGGT TCTCAACTAT GTTTATGCTG TTCCTTTGTA OGCTAAGTTT GCTAACTTTG 
ATATTGGAAA AATTTTGGGA CTTTCCAACT ACCTAATGAC CATGGTATTA CCTTTTAACT 
TGATTGAAGG TGTTATCTTT TCCGTTTCAT TCTGGTTGTT GTATGTTCTC TTCAAACCAA 
CCTTAAAACA TTATGAGAGA TAAACAAACA TTTTTAATGA AGGGCAGTTT TGCCCTTTTA 
CTTTTCGTTA TTCTTGGCTA CATGGTCAAA TTTTACCCTG AAACGCTGGT CAATTTTGAC 
CAATCGATTC AGACTGCCAT TCGAGGAGAC TTACCAGATT ATTTGACTAT TCTTTTTCGA 
GCCCTCACAC GTCTGATTGA TATCCCAGTG ATTATCACTT GGGTTGTCAT TACAGCTTTT 
GTCTTTTATC GTAAGCGATG GAAGATAGAA AGTTTCTTCA TGCTGGGAAA TCTCGCTTTG 
GCAGGTCTTT TAATCGTGAC CTTTAAAAAT ATCTACCAGC GCCCACGGCC AGATATCTTA 
CATCTGGTGG AAGAGAAGGG ATTTTCCTTC CCAAGTGGGC ATTCTCTGGC TGTAACCTTG 
ATGGTAGGT 

(2) INFORMATION FOR SEQ ID NO: 142: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

CTAAAAAAGT TACAAGTGTT GCCATGCCGA GTTACATTGG TTCCAGCTTG GAGTTTACTA 60 

AGAACAATTT GATTCAAATT GTTGGGATTA AGGGAAGCTA ATATAGAAGT TGTAGAAGTG 120 

ACGACAGCGC CTGCAGGTAG TGTAGAAGGC ATGGTTGTTG AACAAAGTCC TAGAGCAGGT 180 

GAAAAGGTAG ACCTAAATAA GACTAGAGTC AAGATTTCAA TCTACAAACC TAAAACAACT 240 

TCAGCTACTC CTTAAAAGCA GATGGATAAT CAAAATGAGT AATCAATCTT TGTCATGATT 300 

TCATGGCAAA GATTTTTTTT GAGTCCAGAT TTGTGATAGA ATAGAGGGAG TTGATAAAAG 360 

GAGGAAAGCA TGGAAGAATC AAAAGAATTA AATGCCGTCA TTGATGTGAT TATGCTAGCG 420 

GGGACTATTC TCCTTAAAAG TGGCTCAGAA ATCCATCGTG TAGAAGATAC CATGATTCGA 480 

ATCGCGCATT CGCAGGGGAT TGTGGATTGC AATGTCCTTG CCATCCCTCC CGCTATCTTT 540 
TTCTCTATTG AAAATACCAA TATTTCGCGC ATGAACGCGT GACCTCCTCT TCTTATAACA 
TCGAAAAAGT CTGCGATGTG AACCAGATTT CTCGTCAG 
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(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

CTCCATATTT CTTAGCCTTC TCAATTAGGG TCTTGAAGTC TTCGACACCA CCGATACGCT 60 

TACCAATATC AGCATAGTTC AAGTGACCAG AGTCATGGCT GTGATATCCT TAACTTTTTC 120 

CCAACCTTGA GGGTTGTTCA TAATGCTACG ATAAGCAATG GCACCATCTT GCCAATCAAC 180 

TTTCTTGTCT GCATTGGCAT CTTCAGTGAT AACAACCTTA GCACTTGGAA GTTCCTTCGT 240 

GTATTCTGGG AAAACAATGC CCTTATAAGC TTTTTCCCAT TGCCATTCAG AGCTGTGGAT 300 

TCCTACATAG TTGGCATTTC CGACTGTTTC TTTATAAGCT GTCAAACGAG TCCAGTCATT 360 

CGAACCACCA CCATAGCTAT TTTGAGAGTT ACTCCAAACA CCAGCAGCAA GCTTATCTGT 420 

AGAAACAAAT CCATACATGT AACCCTTAGC CAAATCCTTC ATTGGATTGG TTACATCGAT 480 

ATGATCATCT CCGCTGACAT GCGTATTGTT TGACATGGTT GCCCCATCAA ACTTAGCACC 540 

AGTTTGATCA CTAGAAACAG AGACTAAAGC ATTGCCGAGG AAACTAATAG AAGAAAGTAG 600 

TTTTCTTTCG TCATCAATCT TTTGACCTGG AGTGACTTGA TTGTGGTTGA CAATCTTGGT 660 

CACATCAAAG TGCAATTGAT TGTCCACAAC TTGCAAGCGT ACTGTCATTT CCGCATTGAT 720 

TAAGTGAGCA TCATCGCGAA GCTTCATCAA GTACTCTGCT GTTGTCTCAT TGATTTTTTT 780 

ATAAGTGACT TCAGGGGTGA TTCGGTGGTT ATTGATAAAG ACTTGGTTGA ATTGTTGCAC 840 

CTGTCCTGGC AAAGTATGTC CATTCAAGGT GTATCCCTTG ACACGAAGGA AGGCTTGGTC 900 

AATTACTGCC TTAAGTACCT TAAACTGGAT CGTATCATAA GTCACCTTGC TATCGTCAAC 960 

AACCGGACCT GTTTCTTTCT GGGCAGGGGT ATCCTCTGGG TTTTACCCTC TCTGTGGCTA 1020 

TCCGTTTCAA CGCTTGAACA ACTGGTCGCT CATCGTCATA AGAGCCCGCC TTGAGAAAAA 1080 

TCTTCTTCTC ATTTCTAAGA TGGTCATTGA CCGCAGCTGG TAGAGTCACT GTGTCAAAGA 1140 

AGATTGACAT CCTTATTTGC CTGGCATTTA CCTGACCGTC TGACTTGAAG ACTGATAGAG 1200 

AGACGGTTTG TTGATCCTGT TTCAGGAGCA GCAACACGAC TACCTCTATA CCAAGTGCTA 1260 

GTTGTTGGAG ATTTATACTC CCAGAACCAG CCATCCTTGT CATAACCGAC AAAAACATTA 1320 

TTATTGGTAT CTTTAAATTT CAAGGAGACA CCAAAGCGTG ATTTGCCCTT TTCAGAATCT 1380 

TCTTTGAAGG TTAAATCAAC AGTTGCATTT CCAT? "SCAT CAACGGTCAA GCCCTTCTTT 1440 

TCAAACAGAG , 

1450 

(2) INFORMATION FOR SEQ ID NO: 144: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1474 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


144: 






CTGTATTGCG 


CCATGAAGGC 


AATCAAAAGT GGTATGCCGT 


CTTGNTGAAA ATCTCTTGGA 


60 


ATAAGCTGGA 


AAAGGGCAGA 


GAAGGACAAG 


TGGAAGCAGT 


CAACCTCAAG 


CATGACCAAG 


120 


TAGCTAATTT 


GCTTTCACAA 


AAGGGGATTT 


ATCCAGCCTT 


CCATATGAGC 


AAGCGCTACT 


180 


GGATTAGTGT 


GTCCCTTGAT 


GATACTTTAT 


CAGATGAANA 


AGTACTGGAA 


TTGATAGAAA 


240 


AAAGTTGGAA 


CTTAACCTCT 


AAAAAATGAA 


ATATTTTAAT 


AATTTTCATG 


AACTTTCAAT 


300 


TAGCTAAATA TTCTTTACTG 


AAGAAATTTT 


CAGAAAATAT 


AGGATTTACC 


ACACCAGAGG 


360 


AATGTGGTGC 


CATCTTCAAA 


TACCTGATTG 


AGAATGTCAA 


GACGGATCGT 


CAGATCATCT 


420 


ATTCGCCTCA 


CTGTCATGAT 


GAACTCCGAA 


TGGCAGTGGC 


AAATAGCCTT 


GCTGCTGTCA 


480 


AGAATGGTGC 


AGGACGTGTT 


GAAGAGACTA 


TCAATGGTAT 


TAGGGAGCGA 


GCTGAAAATG 


540 


CTGCTTTGGA 


AGAAATTGCA 


GTGGCTCTCA 


ATATTCGCCA 


AGATTACTAC 


CAAGTAGAAA 


600 


CCAGTATTGT 


CCTAAATGAG 


ACCATCAATA 


CGTCAGAAAT 


GGTTTCTCGC 


TTCTCTGGTA 


660 


TTCCAGTTCC 


TAAAAACAAA 


GCCGTCGTTG 


GTGGCAATAC 


CTTCTCCCAC 


GAATCTGGTA 


720 


TTCACCAAGA 


TGGAGTCCTT 


AAAAATCCTC 


TCACTTATGA 


GATCATCACA 


CCTGAATTGG 


780 


TTGGTGTTAA 


GATTCCGCTT 


GGAAAATTAT 


CTGGTCGCCA 


TGCTTTTGTT 


GAGAAACTGA 


840 


GAGAATTGGC 


CCTAGATTTT 


ACAGAAGAGG 


ATATCAAACC 


ACTCTTTGCT 


AAGTTCAAGG 


900 


CACTGGCCGA 


TAAGAAGTAA 


GAAATCACAG 


ATGCAGATAC 


TCGAGCTTTG 


GTAGCTGGAA 


960 


CCATGGTTGA 


AAATCCAGAA 


GGCTTCCACT 


TTGATGATTT 


ACAACTTCAA ACTCATGCAG 


1020 


ATAATGACAT 


TGAAGCGCTC 


GTTAGCCTAG 


CCAATATGGA 


TGGTGAGAAA 


GTCGAATTTA 


1080 


ATGCGACAGG 


GCAAGGTTCC 


GTTGAAGCAA 


TCTTTAACGC 


TATCGATAAG 


TTCTTTAACC 


1140 


AATCTGTTCG 


TTTGGTGTCC 


TACACTATCA ATGCGGTAAC 


AGATGGAATC 


GATGCCCAGG 


1200 


ATCGGGTTTT 


GGTCACTGTT 


GAAAACAGAG 


ATACAGAAAC 


CATCTTTAAT 


GCAGCAGGGC 


1260 


TTGATTTTGA 


TGTGTTGAAG 


GCTTCTGCTA 


TTGCCTATAT 


AAACGCTAAT 


ACCTTTGTTC 


1320 


AAAAAGAGAA 


TGCAGGTGAG 


ATGGGACGC A . 


GTGTTTCTTA 


CCACGATATG 


CCTAGTGTGT 


1380 


AAAGGAGAAG 


GCTATGGCAA 


AGAAAATAGT AGCTCCTAAT TACGAGACCG AATTGGCCCA 


1440 


CAAATCATGG AGGCTGGTTT . 


AGAAGTTCTG 


GAAG 






1474 



(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 

CCATACGACC TGTTTCTTTC GGTAGAGCAA CTCCCAGTTG ACCAGTCAAG CCGACCGCTT 
TGGCAATGTA NTCCATATTT TGTTGGATTT GCATGANCAA ATATTGCAAT GGATACAAGT 
TATCACTCTT GATGTAAAGA AGGGCGTTGA ACCAGTCATT CCAGAAACCA AGAGCTGTCA 
AAAGCGTGAT GGTTGCGATA CCTGGTAGTG ACAATGGCAA ACAGATTTGG AAGAAAATCC 
GAGCCTCACT GGCACCATCG ATACGAGCCG ATTCGAGAAT GGCTTCTGGA ATGGTCTTCT 
TGAAAAAAGA ACGCATCAAG ATGATATTGA ATGGTGAGAG AAGCATTGGA ACAATCGAAG 
GCCCAAACAG TGTCACCAAG CTGAAGTACA CGGGTCACCA TGATATAACC TGGTACCAAA 
CCAGCGTTGA ACAACATACT GAGAAGGACG AAGATGGTAA AAAATCTGCG ATACTTAAAG 
GTTGTCCGTG AAATAGCGTA GGCATAGGTT GTTGTGATAA AGACATTTGT CAATGTCCCA 
ACTACGGTTA CAAAGACAGA GATGAAGAGG GCTTGTAGGA TTTTATCCTT AAACTGTGCC 
AAAAACTCAA AACCGTCTAA GCCAAATTGG GATGGGAAGA AGCTATAGCC GTATTGGAGG 
AGGCTTTTCT CGTCTGTCAC TGAAATAATG ATAACGAATA CAAAAGGTAG GATACAAGAG 
AGGGCAATCA AACCCGAAAT GATACTGAAG AAGATATCTG CTTTCTTACT GAAGGAGTGA 
ATGCCGACAT TATCAATTTT TTCTTTTTTA ATTTTCTTTT TTGCCATATT CTCCTCCTTT 
CTAGAACAAA GCTGAGTTTG GATCGACTCG TCTTGCAAGC AAGTTTGATA GGATAACCAG 
AATCAAACCA ACAACGGATT GGTAAAGACC GGCTGCTGCA GCCATACCGA TATCTGCTGT 
CTGAGTCAAA CCATTAAAGA CATATACGTC CAAAACGTTG GTTACATTGT AAAGCTGACC 
AGCATTGTGT GGGATTTGAT AGAAGAGACC GAAGTCTGCG CGGAAGATAT TTCCGACTGC 
AAGGATGGTC AATACAGTTA CAAGCGGAGT CAACTGAGGA ATGGTTACGT TGCGAATACG 
TTGCCACTTG CTAGCTCCGT CCACTGTCGC TGCTTCGTAG TAGGTTGGAT CAATTCCCAT 
GATCGTCGCA TAGTACATGA CACTGCTATA TCCAAAGCCT TTCCAAATAC CTAGGAAAAG 
TAGGAGATAG GGCCAGATGC CCAGGTCAGC GTAGAAATTG ACTTCTTTGA GACCAAGACT 
TTCCAATAGA TGATTGAACA CCCCTTTATC AATATTTAGG AAGGCATCTG TAAAGAAACT 
AATGATAACC CAAGACAAGA AGTAAGGGAA CAACATAGAA GTTTGAAAAA TCTTCACCAT 1440 
TCTCTTAGAA CGGAG 

1455 

(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 769 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

CTAAAAAGAC CAACTTGGCC TTGCCGTCTT GAATGGCCTT GACCACCAAT TCTTCACCCG 60 
ATATGATGCG CCCTGCTCGC TGAGCAAGCC CCAAGAGATT ACTTATCTTT TGCTTATTCA 120 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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AGTCCCAACT CTCTTCTTTT CACTTTGTGA TCCACATAAG CGATCAACTC GTCATAAAAG 180 
CTTTCTTCCA CTTCCATGCT AAAGCTGCGG TTAAAGACCT TCTTCTTTTT CGCCTCTAGG 240 
GCTTCTGCAT TGTCTAGTTT GATATAAGCG CCGCGGCCAT TGGCCTTGCC CGTAGGATCA 300 
ATAAAGACTT GTCCTTCCTT GTTCTTGACA ATGCGGAGCA AATCACGCTT ATCAATCACT 360 
TCGTTAGACA CAACAGACTT GCGCAAAGGG ATTTTTCTTG TTTTCATCTT TCCCTCCTCT 420 
AGCAGCTTTT ATTCTTCTAC AGTATCGTTT TCTACTTCCA ACTCTACTGA AGCAGCGTCT 480 
TCCATGGCTT CAAATTCGCT ATAGCAGACT TGATATCGAT ACGGTAACCA GTCAAGTGAG 540 
CCGCCAAGCG CACGTTTTGT CCACGACGAC CAATGGCAAG AGAAAGCTTG TTATCTGGAA 
CAACCACCAA GGCACGTTTG CTGTCGTTTT CATCAAAGAT AACTTGGTCA ACCCCAGCCA 
GGAGCGATGG GCATGGTAGC ATAAATTCAG CTGGATCTGC CACCCACCTC CAAAACATCG 
ATATTTTCTT CGATTGGTAC CATGCGGTCA TTTTTAGCAT CGTAACGAG 



600 
660 
720 
769 



(2) INFORMATION FOR SEQ ID NO: 147: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

TTCGCAAGGA CAAGGACTTC ACCGGAGGAC AATTTTTGGA GAGACCAGCA GCTGGTAATT 60 

CGAACACCTT CTACCTCGAA TCCCTGTAGT TGACATTTTT TCAGCCAACT CTTGTGATGG 120 

CACATCAATG TCCACCAATT CTTTTAACCA TTTATAAGAT ACAAGCATAA TTTAGTTCTC 180 

CAGAATGACA GTTGTCACTC TAGTTCTTTT CCTTTCCTAT CATTTCAATA GAAGAATCCT 240 

CTTCTTACCT TAATTTCTTT CTCAGTAACC AATCCGTATC TACTTTTTGA CCAACCATAA 300 

AATGATGTTG GCTAAATTTT TCAAAACCAT ATCGGTTATA AAACGCTTGA GCTTTTGTAT 360 

TATGCTCCCA AACACCTAGC CAAGCCCAAG AAAAACTATT TTTTGTAGCA AGTTCAAGAG 420 

CAAATTCAAA CAGTTGCTTA CCTAGTCCAA ATCCTTGGAA TTTTTGTAGC ACATAGAGGC 480 

GTTGAATTTC AAAAGCGTCC TCTAATTCTC TCTCAGTTTG AGCACTTCCC CAGTTGACTT 540 

TGAGAAAAGC AGCTATCTCC TCCTCCTGCA TAATGAAATA GGTTTCAGAG TCAGGATTTC 600 

CCAACTCAGT TGACAAAGTT TTCAGACTAT AAGCCTCTTC AAAGTATTCC TGTAACTGCT 660 

CTTCCGTATT ATCATACGCA AAGGTTTCAC GAAAGGTTTG TTTGGCAATT TTAGCCAACA 720 

CCTCAACATC TGCCATTTCT ACTTTTCTAA TCATTATTTA AACTGTTCTG AGAAGCGGAC 780 
ATCTCCTTGG TAGAATCCAC GGATATCGTT GATTCCATAA CGGAGCATAG ATCG 



834 



(2) INFORMATION FOR SEQ ID NO: 148 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 856 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 148: 



CTCGTAAATG GAATGTGGTC AGTAATTTTG GTAAATTTGC AGATCCTATG GCGGATAAGT 60 

TACTAGTTAT GTCGGCTTTT ATTATGTTGA TTGAGTTAGG TATGGCTCCG GCTTGGATTG 120 

TTGCAGTGAT TATCTGTCGT GAGTTAGCTG TGACAGGTTT AAGGCTTTTA TTGGTTGAAA 180 

CTGGTGGAAC AATTTTAGCA GCAGCAATGC CTGGAAAAAT TAAAACTTTT AGTCAGATGT 240 

TTGCCATTAT TTTCTTGCTA TTACATTGGA CTTTGCTTGG TCAAGTTCTA CTTTATGTAG 300 

CCTTATTTTT CACTATCTAC TCTGGCTATG ACTATTTCAA GGGTAGTGCC TATGTATTTA 360 

AAGGGACATT TGGTTCGAAA TGAAATCAAT AATTGATGTA AAAAATCTTT CTTTTCGCTA 420 

TAAAGAAAAT CAGAACTACT ACGATGTGAA GGATATTACG TTTCACGTGA AACGTGGAGA 480 

ATGGCTTTCG ATTGTAGGGC ATAATGGTAG TGGTAAATCA ACGACGGTTC GATTAATTGA 540 

TGGCTTACTG GAAGCAGAAT CCGGAGAGAT TGTAATTGAT GGCCAACGTT TGACTGAGGA 600 

AAATGTTTGG AATATACGTC GTCAAATCGG TATGGTTTTT CAAAATCCAG ACAATCAATT 660 

TGTTGGAGCG ACTGTTGAAG ATGATGTTGC TTTTGGTTTG GAAAATCAGG GACTTTCTCG 720 

TCAAGAAATG AAAAAGAGAG TGGAAGAAGC TCTGGCTTTA GTTGGCATGT TGGACTTTAA 780 

AAAGAGAGAA CCAGCGCGTC CTATCAGGTG CCAAAAGCAC GTGTGGCCAT TGCAGGTGTT 840 

GTAGCCCTAA GACCAG 856 

(2) INFORMATION FOR SEQ ID NO: 149: 



( i ) - SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1037 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149 : 

TCCCCATCCC AGTGGGCAGC AGCATTAGTC TGTACTTCTG CGTCGTTGCG TTTTATTAGA 60 

AGACTGATTT TGCTTTGAAC TACTTGAAGC TGTCGTTGTA TCCTGCGAAG TTTTTGCTAA 120 

CTTCAGTTTC ATTTGCAGCG ACATGACTAG TTGCCAAGCC TAGTAAACAG ATACTTGCTA 180 

ATCCAATTTT TGTTTTCAAT CTTTCCTCTC CTATAAAAAA TGTAACAGAC ATCTGAATGC 240 

TGTTCCACCT AGCTTTTGCT ACTTACTGAT TATTTTACAA AGTCAAGCAA AGCCAAGAAG 300 

CTTTCAGCTT CAAGTGACGC ACCACCTACA AGGGCACCGT CAACGTCTGG GCAAGCCATG 360 

TATGAAGCAA CATTTTCAGG TTTAACAGAA CCACCGTATT GAACACGAAC TTTGTCTGCG 420 
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ACTTCTTGAC CAAAGTCAGC AGCTACAACG 
TCGTCTTGTG AAGCTGATTT ACCAGTACCG 
GAGGCAGCAA CTTGTTCAGC AGTCAATCCA 
AATTCAGCAG CTTTACCAGC TTCGTAAGTT 
AGCATACCGT TCGCAAAGAT TGCTTTTGCT 
AAGTAGTCAC GGCGTTCTGA GTGACCGATA 
ACTTGTGGGC TAGTTTCACC AGTGAAAGCG 
AGCAACGTTA AAGTTTGAGC CTTNGGCAAC 
AGCAGNGATA CCTGCTTCAA CAAGATCTGA 
GAATGNTCCA GCTTNTTGGG GATTTTTGTT 
ACGTGACATT TCACATA 

(2) INFORMATION FOR SEQ 
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TCACGAACAA 


CTTTACACAT 


TTTTTGTGCA 


dflO 
*mO\J 


ATAGCCCAGA 


TTGGCTCATA 


AGCGATAACT 




GCCAATGCAG 


CAGATACTTG 


AGCACCTACG 


son 


TCAAGTGATT 


CACCACAACA 


GATGATTGGA 




m-rixriTGA 


CATCTTCGTC 


AGTATCATGG 


720 


ACAACGTAGT 


CAGTACCGAT 


TTCTTTfAAA 


•jon 
/oU 


ACCTGCATTT 


TCAAAGTAGC 


AGTATTGAGC 


840 


AGCAAGNACA 


GTTGTCAAAT 


CAAGAGCTGG 


900 


TGAAGGAAGT 


TTTGATGCAA 


CTGCTTCAAC 


960 


CATTTTCCAG 


TTACCAGCGA 


TAAATGGTTT 


1020 
1037 



ID NO: 150: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 998 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

GATTCTACAG GATCCATTTT ACTATCTTAC GCGCCGGGAA GTAGGCTGAG ACATAACCAA 60 

GTAATAGAGC CAACAACTAG AGTTCCTAAA ACAGATAAAA GATTTAATTC AAAAACCTTA 120 

GTGATGGATG GGTAAAAGTG ACTTACAATC GCATTCGCCA AACTTCCCAC CCCTTGTGCA 180 

ACCAAAAATG CCAGCAGCAA GGCGATGCCT ACAATCCAGA TAGCCTCGTA AATAAAAATT 240 

CCTTTGACAT CACGATTCTG ATAACCAACT GCTTTCATGA CACCTATTTC CTTGGAACGT 300 

TGCATGATAT TGATGTAAAT AATGATACCA ATCATAACCG CTGCTACCAC AATAGCTTGT 360 

GATGAAAGCA CAATCAATAA TCCCTGAATA ACACGAATAA AGGTAATCAC AATATCAAGA 420 

ACTCTCTGTT AAGAAAGCAC AGTATACTTC TTATTTTTCT GTAATTCTTC TGTTACTACT 480 

TTTGTCTGTG ATGGATCTTT GAGTTCCAAG ATAAAATAAG ATACAGCTTT CGTAAATCCA 540 

GCCTCTTTCA AAATCGTTTC CATTTGATGA GACAGCATGA AACTGTTGCT GTCCTCCATG 600 

TCATCTTCAT CATTGATTAC ACGTACAATC TTCGTTTGAA ATTGAGCAAT CTTACTAGTT 660 

TC ftGCAGCA C TTTCTACAAT GCTGACTGAG ACTGATTTGC CAATAAGATC ATTAGCTGTC 720 

AAATTTTTTC CTGTCTGTTC ATTCCAATTT TTTAGTAAAC TGCTTGGAAT CGTTAATCCC 780 

TGTTCATTTG TATCAGTATA GAGGGATCCA GCCAACACTT TGTCCGTCTC ATTATTACTA 840 

ACAGAGATAC TTGTATCATC ATAAAGACTC ACTACTTGAG CATAAGAAGC ATCGTTTGAC 900 

TCAAATCCAT TTCTTGCCCA TCTTTTCTTG CCCATCTATA GTAATATTTG ACATGTTCAT 960 

CCCAAAAGGA CTCTCCAAAT ATTTAATAGA TCGAGCCT 998 



(2) INFORMATION FOR SEQ ID NO: 151: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

CACATGTCGA CTAAGCCGTT AGAAATCAAG TGTATAGGCA GTTCATCATC TAATAGAGCG 60 

TTGCGTCCGA AAGTCTATCC AGACACGGCT CTTTAAAAAC AAAAGGAGAA ATGATGCATA 120 

CTTATTTGCA AAAGAAAATT GAAAATATCA AAACAACCCT AGGTGAAATG TCAGGTGGTT 180 

ACCGTCGTAT GGTTGCGGCT ATGGCTGATT TAGGATTTTC AGGAACTATG AAGGCTATCT 240 

GGGATGACCT CTTTGCCCAT CGTAGTTTTG CCCAGTGGAT TTATTTGCTG GTTTCAGGAA 300 

GTTTCCCTCT CTGGCTGGAG TTGGTTTACG AACATCGTAT TGTTGACTGG ATTGGGATGA 360 

TTTGTAGCTT GACAGGGATT ATCTGTGTAA TCTTTGTATC GGAAGGTCGA GCAAGTAATT 420 

ATCTTTTTGG CTTGATTAAC TCTGTTATTT ACCTTATTTT GGCCCTACAG AAAGGCTTTT 480 

ATGGTGAGGT GCTGACGACA CTTTACTTCA CAGTCATGCA GCCAATTGGA CTCTAGTTTG 540 

GATTTATCAG GCACAGTTTA AGAAAGGAAA AGCAGGAGTT TGTCGCGCGT AAACTGGACG 600 

GCAAGGGCTG GACAAAGTAT CTTTCCATTA GTGTGCTTTG GTGGTTGGCC TTTGGCTTCA 660 

TTTATCAAGT CTATTGGTGC CAATCGTTCC CTTATCCGTG ATTCAATCAC CAGATGCAAC 720 

CAATGGGGTA GGGCAAATCC CTCATGACAA GTCTGTTTAC CGTGAACAGT GGATATTCTG 780 

GGCGGCTACC AATGTCTTTT CAATCTATCT CTGGTGGGGA GAAAGCCTGC AAATTCAAGG 840 

GAAATATCTA ATTTATCTCA TTAACAGTCT AGTTGGTTGG TATCAATGGA GCAAGGCAGC 900 

TAAGCAAAAT ACTGATTTAC TTAACTAGGA AAAGATGTTT GAAAGTGCTG TTTTGAGATT 960 

TCGATTAAAA CAGATATAGT TGATAATCAA GGATTTATAG TATGAAAAAG AGGATCGGCG 1020 

GGTCCTCTTT TGTTGTTGAA AAGATAAAAA ACTCAGTAAC CTAGAAATAA GACAACTGAA 1080 

GCTTTACTCT ATATTCAATT TTTAGGAATG AGAAGGTCTA GATAAAATTG GACAACTTCC 1140 

TGGTCTGTGA AATCTTGACC TTTTTTGAGC CACCAGGTCA ATGTCTCGAT AAAGTTGGAC 1200 

ACGACCAAGT GTTGGAGGTA AGAAGTAGGC AGATTAGGGT GGGCTTCTTT TAAATTATCA 1260 

1261 

(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



G 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 



CTGATCATCT AACACAGAAG GGAACGTCTC ATCTATTCTT TGATCTGGTG GTAGGTAACT 60 
GTTTCTTTGC TTTAGGATGT GGTTAGGGAG GTGAAGTCAG ATAAGTAGAT GGGCTTAATT 12 0 
TGCCCAAGTC CCACATCAAA GAATTATCGG CGAATCCAAG GTGCTCCTTT AATTATTTGT 180 
AGATCTTCAG GAGCAAGGGA GACAACTTGC ACGTGTCCAA CGTAACCTGA AAGGCGTGCC 
TGTTTTAAGT GATTAACTTT TGTCACGATG CCTTTTTGTG TTAGTTCGAT TTCTAGGGGA 
ATGGATCCAG TTTTTTTCTG AACGAGACCT GAACCATGAA ACTTGTGCCT CATCATCAAT 
GAATGAGATT CTTTATCTGT TCGAGTTCGA TCGACTACAG CGTCAAGGCT AACAAATAGA 
TAGCCTCTAT CATGTCTGTT TT 



240 
300 
360 
420 
442 



(2) INFORMATION FOR SEQ ID NO: 153 ; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1282 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

GAGCTGATGA ACGCTCCAAA GAATTTCGCA AATCATCAAA ACGGTACTTA ATCCCCCCAA 60 
CCAAATCGGC ATGACCTGGG CGAGGATGAG TAATTTTCCG CTTGCTTTTA AGGCGGTCTT 120 
CAATGTCCTC CGCAGACATG ATGTCCAGCC ATTTCTGGTG GTCCTTATTG ATGACATACA 180 
TAGTAATAGG CGCCCCTGTC GTCTTCCCGT GGCGAACGCC CGAAGTAAAG ACAACCTGGT 
CATTCTCAAT CTTCATACGA CCACCACGAC CGTAGCCACC CTGACGGCGT CTAAGGTCCT 
CATTGATATC CTCAGCTGTC AATGGAAGTC CAGCTGGAAT TCCCTCAATA ATAGCTGTTA JOU 
GACGGGGACC GTGTGATTCT CCTGCAGTTA AATATCTCAT ACACTCTCCT TATTTTACCA 420 
AGTAGTCTTT CATCTCTTCC AGAGAAACTG GATGAATGGT CGCTGAACCA AGCTCTGGCA 480 
CCAAGACCAA TTTCAAGGTG TTGCCACGCG CTTTCTTGTC ATGAGTAAGA GCCTGATAAA 540 
GCTTGTCAAC TTCCCAATTT TCATAGTCAA CAGGCAAGCC GAATTTCTGA CACATCTCTG 600 
TGATAGATTG GGTAATGCCA GCTGGCATGA GGCCTTTTTC CTCAGCAATC TTGGAAATCT 660 
GTACCATTCC CATGGCAACA GCCTCTCCAT GCATGACCTT GCCATAACCG GCAGTCGCTT 720 
CGATGGCATG GCCAATAGTG TGGCCAAAAT TGAGGTAAAG ACGAATACCA TTGTCCAACT 
CATCTTCAAC CACCATCTTG CGCTTCACCT GACAAGAATG TTCAATCAAG GTCTCTGCAT 
GTTCCAAAAT ACTCTCAACA GAACCATTCA GTCCCGTCAA GAGAGCCCAC AGTTCTGGAT 
CCTCAATCAA GCCATACTTG ATAACTTCAC CCATCCCTTC AATCAACTCT CTTTTTCCGA 
GGGTTTCAAG AACAAGTGGA TCAATCAAAA CCCCATCTGG TTGGGCAAAG GTCCCCACCA 
TATTTTTAGC AAATGGTGTG TTAACACCTG TCTTTCCACC GATAGAAGAA TCAACCTGAG 
CTGTCAAACT AGTCGGAATC TGAACAAAGT GAATACCCCG CATATAGGTA GAGGCTACAA 
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780 
840 
900 
960 
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ATCCCAGCCA GGTCCCAACA ACGCCACCAC CAAGAGCAAC GATTCCATCC GCTACGANTC 1200 
NGAACTTGCT TGACTAGAAA TTCATACACT TTCTGAACCG TAGTTAAATT CTTTCTTTCT 1260 
TCACCTTCTA AGAAGTCCAA AA 

1282 

(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 769 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

CTCACGCAGT TCTGTTACCA AAGACCGTTC AAAAAGCAAA CTAGCAATAC TTATAACTTA 60 

TTAGAAATTC CTAATTTTAC CTTGCTAAAC GAATACAGGT CACGATGCCA TCTGAGTCTG 120 

TGATTTCCAA TTGGCTCGAT GTCATCCATT TGATTGGTAC GTCAACTTCT TGTGCTCGTT 180 

GGGCAATCGT TAACAGTTCT TCTTTATGTG CAACTTCGAT GACATAGTAG GCTAAACCTG 240 

GTAAGCCTTG CTTACGCGGA AGCCAGACCT TTTCCTTCCC CATTCGTTAA CTGCTAAATG 300 

ATGATGGTAA TCTCCAGCCG CAATCCAACT AGCGCCAGGC ACACTGAATT TATTCTCTAG 360 

CCCTAACACC TTTTGATAAA ACTGGCTGGA CTTTCGACTA TCCTTGACGG AAAGATGAAT 420 

ATGCCCCATT CTTGTACCTT CTGCCAGGAT AAAGGGCTCT ACTCTTTCCC CCAACTCATA 480 

AATGTCCTGC GCCGCAAGAG CCTCCGTCAC TCCGATAATG CGTCCATCTT CTCGAATATC 540 

CCATGTGGAA ACTGGCTTAT CTCGATAGAG TTCAATGCCA TTTCCCTCCA AATCTTCCAA 600 

GTAAATAGCT TCACTGTAAC CATGGTCTGC ACCGCCGACA AGAGGAATTT GCAAATCTGT 660 

CAGATGTTTC AAGACATCAG CCAAAGCCTT GCGTGTGGGC AAGAGAATGG CCAAATGGTA 720 
AAGACCATAA TGTTCCCTCA CTTCTCCGCT CTTCTTGTGC TTGAATCAG 



(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 



769 



CTGGAGATTA TTGAGATTGC CCGTCAAAAC GATTTGATTA TNTTTGCGGA TGAAATCTAT 
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GACCGCATGG TAAATGGACG GACATGTGCA TAACGCCTGT GGCGAGCTTG GCACCAGATG 120 

TCTTCTGTGT CAGCATGAAT GGTCTGTCAA AATCCCACCG CATAGCAGGT TTCCGTGTGG 180 

GTTGGATGGT CTTGTCTGGC CCTAAGACTC ATGTTAAGGG CTATATCGAA GGGCTCAATA 240 

TGCTGTCCAA TATGCGCCTT TGCTCTAACG TTTTGGCCCA ACAAGTCGTA CAAACTTCCT 300 

TGGGGGGTCA CCAATCAGTC GATGAATTGC TTCTTTCCTG GTGGACGAAT CTACGAGCAA 360 

AGAAATTTCA TCTATAATGC CATTCAAGAT ATTCCAGGTT TGTCTGCCGT TAAACCCAAG 420 

GCGGGGCTCT ATATCTTCCC AAAAATCGAC CGCAATATGT ACCGTATCGA TGATGATGAG 480 

CAGTTTGTCC TTGATTTCTT GAAGCAGGAA AAGGTTCTCT TGGTTCATGG TCGAGGCTTT 540 

AACTGGCAGG AACCAGACCA CTTCCGTATC GTTTACCTTC CTCGTGTTGA TGAGTTAGCC 600 
CAAATCCAAG AAAAGATGAC TCGTTTCTTG AAACAGTATC GTAGATAGGG CTTGCATTCG 
AAAAAGCTGG AAACATTTGC CTAGAG 



660 
686 



<2) INFORMATION FOR SEQ ID NO: 156 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 



CTAAAAAAAG TCAAGTAGAA AACGAATATT CTACTTAACT TGCACGAAAT TATTTTTCAC 60 

GAATGACTTC GACCTTATAT CCATCAGGGT. CTTTGACAAA GTAATAGTTT GGTGCAGTTC 120 

CTGGTAGACC ATTTGGCTCA GTCACTTCAT AGCCTTTTGT ACTGTGCTCT TGATGAAGTG 180 

CCTCAAGATC AGGTGTACTG AGGGCGATAT GGGCAAACCC ATCACCAACC ACATACGGAC 240 

CGTGATCGTA GTTATAAGTC AACTCCAACT CATAGTCATC ACCCTCAAGA CCTAGATAGA 300 

CAATCGTGAA GGCATGGTCT GGAAAATCTC TGCGACGCAA TTCTTTAAAA CCAAAAGCAT 360 

CTTGATAAAA TGCAATTGAT TTTTCAAGAT TTTCTACTCG TAAGCAAGTG TGTAGCATTT 420 

TTGAAGCCAT ATCTTTCTCC TTTATTTTTA AAAAGACTGG ACAATCCTGT TCCAGTCTCA 480 

TCAGTTGTTA TTTACCAAGT TTTGCTTTAG CTGCATCTGC AAGAG 525 

(2) INFORMATION FOR SEQ ID NO: 157: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 575 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
575 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

CTAGCTTTGT GTCCTCTCTT CTCTACGGAG TGGCGGTTTT GATTATCGCC TGTCCTTGTG 
CCTTGGGACT TGCAACACCG ACAGCCCTTA TGGTGGGGAC AGGACGTAGT GCCAAGATGG 
GGGTTCTCCT CAAAAATGGA ACTGTCTTAC AGGAAATCCA GAAAGTTCAA ACTCTTGTCT 
TTGATAAGAC CGGGACTTTG ACGGAAGGGA AACCTGTGGT AACAGATATC ATCGGCGACG 
AAGTAGAAGT GTTTGGGTTG GCAGCCTCCT TGGAAGATGC TTCTCAACAC CCACTGGCTG 
AGGCTATCGT TAAGCGAGCG AGTGAAGCTG GACTTGAGTT TCAAACTGTT GAAAATTTTC 
AGACCTTGCA CGGGAAAGGT GTTTCCAGGG CGAATTAATG GAAAACAAGT TTTACTTCGA 
AATGCTAAAA TGCTGGATGG CATGGATATT TCTAATACTT ATCAAGATAA ACTAGAAGAA 
CTAGAAAAAG AAGCTAAGAC AGTTGTGTTT CTTAGCTGTT GACAATGAAA TCAAAGTTTT 
GCCTTGCTTT GCCAGATATC CCTAAAGAAA TGCTG 

(2) INFORMATION FOR SEQ ID NO: 158: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

j 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

CTCAATAGCA CCAATAGACA GTTTCTTGGT TACATCATAA GCACCAGGTG TTACCAAAGC 
TTCAACAGCG TGAGTCAGAG CATCCATACC AGTAGCAGCA GTCAGGCCTT TAGGTTTGGA 
AAGCATGAGT TCCGGGTCAT TAACAGAGAT AAGAGCAAGG CTATTCTTGT CAACCATTAC iOV 
CATCTTAACC TTGCGTTCTT CATCAGTAAT CACATAGTTA ATAGTGATTT CTGCAGAAGT 240 
TCCTGCCGTT GTATTGATAG CCACAACTGG CAAGCCTTTT TTAGCAGACT TATGAAGCCC 300 
TTCATAATCC TGTGGTTTTC CACCATTTGT AGCCATGATA GAAATACAAC TAGCTGCATC 360 
CTGTGGAGAA CCCCCGCCAA GACTGATGAT AAAGTCACAG CCATGTTCTT GCAAAGCAGC 
TACCCCATCT GTGACATTTT TACAAGTAGG GTTTGGCGTC TACATCGCTA AAGATGACAT 
ATTCGATTCC TTCTGTATCC AGTGGTTTTA AAACCTTAGG TAAAATATCA CTGCCTTCGA 
TGTACTTATC TGTCACCAAA AGTGCCTTTT TATAGCCAAG TTCCTTGATA TAAGGACCCA 
CTTCATTTAC AACACCTTTT CCAATAAGGT TGACTGCCGG AACATAAAAT GTAGCCATAT wov 
AATTTTCCTC CTGCGCCTCC GCGCTATCTA TTTAACATCA TTATATTACT TTTTGTATTC 720 
GTTTTCAATT ATTTGTGAAT TGTTTAACAA ATCTTTTTAA AAGGATAGGA GGGACGTAAG 
TTCACCTCCT ATATAAACAA GGTTTATTTA TGTAGTGGTC CCAATAACTG ACATGCACGA 
TAGTCTGCTG ATTCTAGATC TTCTGTTCCA AATAAGGACC AATTTTTAGG TCTAAAGATA 
TCTTCTTCGG GTACATTATG CATATTGACA GGAATTCTCA ACATAGAAGC CAAGGTAATC 
AAGTCTGCTC CAATGTGTCC ATATGTTATG GCTCCGTGAT TAG 

200 



60 
120 
180 



420 
480 
540 
600 
660 



780 
840 
900 
960 
1003 



WO 97/43303 PCT/US97/07950 



(2) INFORMATION FOR SEQ ID NO: 159: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

CTAAAGTTTC TGATGAACTT GGTGGTGGAT CAATTACAGC CCTACCATTT ATCGAGACAC 60 

AAGCAGGAGA TATCTCAGCC TATATCGCAA CCAACGTGAT TTCTATCACT GATGGACAAA 120 

TCTTCCTTGG CGATGGCCTC TTCAATGCAG GTATTCGTCC AGCCATCGAT GCGGGTTCAT 180 

CTGTATCTCG TGTAGGTGGT TCTGCACAAA TCAAAGCCAT GAAGAAGGTT GCTGGTACAC 240 

TTCGTATCGA CCTTGCTTCA TACCGTGAGT TGGAAGCCTT TACTAAGTTT GGTTCTGACT 300 

TGGACGCAGC AACACAGGCT AAGTTGAACC GTGGACGTCG TACCGTTGAG GTCTTGAAAC 360 

AACCTGTTCA CAAACCATTA CCTGTTGAGA AACAAGTAAC CATTCTTTAT GCTTTGACAC 420 

ATGGTTTCTT GGATACTGTT CCAGTAGATG ATATTGTTCG TTTCGAGGAA GAGTTCCATG 480 

CCTTCCTTTG ATGCTCAACA TCCAGAGATT TTGGAAACCA TTCGTGATAC AAAAGACTTG 540 

CCAGAAGAAG CAGTCTTGGA TGCTGCGATT ACAGAGTTTC TCAATCAATC TAGCTTCCCA 600 

ATAAGAATAG AAGTGTCAGA TGGCAGTGTC TCTAAATGAT ATTAAAACAA AAATCGCCTC 660 

AACAAAAAAT ACGAGTCAAA TCACTAATGC CATGCAAATG GTATCGGCTG CTAAG 715 

(2) INFORMATION FOR SEQ ID NO: 160: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1087 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) .TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 



ATCAAAGAGC CTATGTGCAG GCTGGGGCAG GGATTNTCTA GATCTATCGC CCAAAACGAA 
TACCAAGAAA CCATTAACAA AGCAAAATTT ATGACTAGAA TTGGAGAACT AAGACCATGA 
TTTTATTGAT TGACAACTAT GATTCTTTTA CCTATAACTT GGCCCAATAC ATTGGGAATT 
TTGCAGAAGT TCAGGTTCTG AGAAATGATG ATTCCAAGCT GTATGAAGAA GCTGAAAAAG 
CAGATGGTCT GGTCTTTTCT CCTGGTCCTG GTTGGCCAGT TGATGCTGGA AAGATGGAAG 
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ACATGATTCG TGATTTTGCA GGCAAGAAGC CGATTCTTGG GATTTGTTTG GGCCACCAAG 
CCATTGCAGA AGTCTTTGGT GGTAAGCTAG GTTTGGCTCC AAAAGTCATG CATGGGAAAC 
AGAGCAATAT CAACTTTGAA GCGCCATCTG TTTTGTATCA AGGTATTGAG GATGGCCGTC 
CGGTCATGCG TTATCACAGT ATTTTGATTG AGGAAATGCC AGAAGACTTT GAAGTGACAG 
CTCGTTCGAC TGATGACCAA GCCATCATGG GGATTCAACA TAAAAACCTA CCGATTTATG 
GCTTCCAGTA CCATCCAGAG AGCATTGGAA CGCCAGACGG CTTGTCTTCT ATTCGGAATT 
TTATCGAGGA GGTTGTAAAG TGAGGAAACT AGGATGAAAG AGATTATTGA AAAACTAGCA 
AAATTTGAAA ATTTATCAGG TGTGGAAATG ACGGATGTCA TTGAGCGTAT CGTAACTGGG 
CGTGTAACTG AAGCGCAGAT TGCTTCTCTC CTCTTAGCTC TTAAGATGAA GGGGGAAACA 
CCTGAAGAAC GCACAGCCAT TGCCCAAGTC ATGAGAGGAC ATGCCCAGCA TATTCCAACT 
GAGATTCATG ATGCTATGGA CAACTGTGGT ACAGGTGGGG ACAAGTCTTT CAGTTTTAAT 
ATTTCCACAA CTGCAGCCTT TGTCTTGGCT GGTGGCGGTA TTCACATGGC CAAGCATGGT 
AACCGCTCGA TOM ATCTGGTTCC GCAGATTCCT CNAACCTTGG GAATCAATCT 10 80 

1087 

(2) INFORMATION FOR SEQ ID NO:161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1089 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO:16l: 

CTCTGCTTGC TTTTGTTGCT TGGAATTTTG TCACCTTTTT TCTT^CCCA CCTACTCGAA 
ATGAAATTTC TTATCAACAT GCTGCTCCTA CTTTTACAGG AGTTACGGCA TTTTTGATGT 
ATTTTTTTTA TCCTGTGATT GCAGGTCCCA TTTTTGAAGA CATGATTTAT CGTGGATTAG 
TGATGACTGC TCTGGAAAAA GGAAAGAAAT GGGGGCTAGA TGTACTTGGT TCCGCTGTTT 
TATTTGGGGT CTCGCACATT AGTAATCACG GTTGGGTCTT GACAGACTTT GTCTTCTATA 
TGGGTGGAGG TCTTATATTT GCAGTCTTAT TTAGAATGAC AAAGTCAATT TATTGGCCTA 
TTGGACTGCA TATAGTCTAC AATGGCATTG GTCAGCTTTT GATGTTACTG TAATTTTGGT 
^TTAGTAGT ATCGCGGTCT ATCCTTGGGG AAAGCTTCCC ATTTGAATTT AAGAAGGAGT 
TCTGATGAAA GTATTTCTTC AAAATAGAGA TTTTAGACAA TTAACCATCA ACCAGTCGAT 
TTCAACGCTT GGGGATACGA TTTTTTATCT GGCCTTCCTG AATTATGTGG CAGATACATC 
TTTTGCCCCT TTGGCGATTT TACTCATCAC GATTTCAGAA ACCCTGCCCC AAGTTCTACA 
AATCTTTCTG GGAGTTTTGG CGGATTTTCA ACATCATCGT GTCTTGAAAT ACACAGTCAT 
TAGTTTTGCA AAATTTTTGC TTTACTCTAT AGTGTCCCTA TCGCTTTCAG GGCAGTCTTT 
TTCCTTGTTG CTAGTAACCT TTATTTGTCT GCTTAACCTC TTGTCTGACA CATTGAGTGC 
TTTTTCAGGC GCCATGCTCA CTCCGATTTT CATTAGAATT ATTGGGCAAG ACCATCTGGC 
AGAAGCTATT GGATTTAAAC AGTCAACTGT TAGTTTAGTG AAAACAACCA GTAATATCCC 
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AGGAAGAATC TTACTAGGCA TTCTATCCAT CCAGTTTATT TCCTTACTGA ACGCTCTTAC 
CTTTTTAATC GCATTTTTAG GTATCCTCTT CATAAAAACT GACCTCTTGA AAGTAGAAAA 
AACGATTAG 

(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 162: 

ATTCAAGGTT CATTCGCGGA CCAACATCCG TTCACTTTCA TTTGCCATCG GTGGATGGTG 
CCCTTCCAGG AAATGGAGGG TCGTGGTTAT GTCCTTTCGT CGTCTTCTCC GTCGTGCTTC 
TATGCATGGG TCAAAAATTG GGTATCAACG AGCCTTTCCT TTACAAAACT CGTTCCAACT 
GTTGGAAAAA TCATGGAAAG CTACTACCCA GAAGTGCTTG AGAAACGTGA CTTTATTGAG 
AAAATCGTTA AGAGGCGAAG AAGAATCATT TCCCCGTACC CTTCACTCAA GGTCAACACT 
TTGCCCAAGG CATTGTAGCA GACTTGAAAG AAAAAGGTCA ATCTGTTATC GCTGGTTCAG 
ATGTCTTCAA ACTCTATGAT ACTTATGGGT TCCCAGTTGA ATTGACTGAA GAAATCGCTG 
AAGAAGCTGG TATGACTGTA GACCGTGAAG GTTTTGAAGC AGCCATGAAA GAACAACAAG 
AACGCGCGCG TGCGTCAGCT GTCAAGGGTG GCTCAATGGG TATGCAAAAT GAAACTCTTC 
AAAACATCAC TGTAGAAAGT GCCTTCAACT ACAATGCTAG CCAATTGTCT TCTAAATTGG 
TAGCTATCGT GGCGGACAAT GCAGAAGTAG AAGCTGTTTC AGAAGGAACT GCCTCTCTTA 
TCTTTGCGGA AACGTCATTT TATGCTGAAA TGGGTGGACA GGTAGCTGAC TACGGACAAA 
TCTTGGATGA GTCAGGTAAG GTTGTGGCTA CTGTGACCAA TGTTCAGAAA GCCCCAAATG 
GTCAAGCCCT TCATACAGTT GAAGTCCTTG CACCGCTTGC CTTGAACCAA GAATATACCT 
TGGCAATTGA TAGCAATCGC CGTCACCGTG TCATGAAAAA CCACACTGCG ACTCATTTGC 
TTCACGCTGC CCTTCACAAT ATCCTTGGAA ACCACGCAAC ACAGGCAGGA TCTCTTAACG 
AAGTTGAATT CCTTCGCTTT GACTTTACCC ACTTCCAAGC TGTAACTGCT GAAGAATTGC 
GTGCGATTGA ACAGCAAGTC AACGAAAAAA TCTGGGAAGC TCTTGAAGTT AAGACAGTTG 
AAACGGATAT TGACACTGCT AAAGAAAATG GGAGATCGAA TTCCTGCAGC CCGGGGGGAT 
CCACTAATTC TANANCGGCC GCCACCGCGG TGGAACTCCA CTTTTNTTTC CCTTTAATGA 
AGGTTAATTG CCNCCTGGGG TTANCATGGC CCNACCGTTT CCCGTTTTAA ATTGTATCCC 
CCCCNAATC 

(2) INFORMATION FOR SEQ ID NO: 163: 



1020 
1080 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1243 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic .DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


163: 






CCGTTGATAT 


AGGCTTGGGC 


ATTGCCACAG 


AACTTCTTAC 


CATCAATCTC 




OU 


CGGACCTGTG 


AACTCAGCTT 


TAACCCCGAG 


TTGAGCCAGG 


GTATTGATAA 


CCGGAGTTGA 




GAAGCTCTTG 


AAGTCAAATG 


CTTATTTTCA 


TCTTCTTTGG 


AGATGATCGT 


GTAGTTGAGG 


1 PA 


TTATTTAAAT 


CGTGGTAAAC 


AGCTCCACCA 


CCACTAATAC 


GGCGAACTAC 


CTCAATACCA 


1 A ft 


TTTTCGCGAA 


CATAATCACG 


GTTGATTTCT 


TCGATAGTGT 


TCTGGTGACG 


ACCAACAATG 


JUU 


ATAGATGGCT 


TGTTAATCCA 


AAGTAGGAAG 


ATTTGATCCT 


CATCCAAAAG 


GTGTTTAAAG 


J oU 


GCGTATTCTT 


CCAAGGCAAT 


ATTAAAAGCA 


GTGTCATTTG 


AATGATTGAT 


AATGTATTTC 


a *> n 


ATGATATCCC 


TTTACTTTAT 


ATGATAGAGA 


CTGGAAATAA 


CCTTCCAGTC 


TAATCTATCT 




TCGTTTTATT 


TTTTCTTAGG 


TGAATGGATG 


GCCATTCCTA 


GAACATCTGC 


AAACGCTTCG 




TACATCACTT 


CAAAGTAAGT 


TGGGTGTCCG 


TGGATGGTCT 


TCAGCATTTC 


CTCAACAGTG 




ATTTCCATTT 


CGATGATGCT 


TGATGCTTCG 


TTTATTAATT 


CTGCGGCTGC 


AGGAACAATA 


660 


ATGTGTACAC 


CAAGGATTTC 


TCCGTATTTC 


TTATCAGCGA 


TAACTTTTAC 


GAAACCTTGA 


720 


GCTGCGTCAG 


ATGCAATAGC 


ACGACCGTTA GCAGCAAAGT TAAACTTACC GATGGCAACA 


780 


TCGTATTTCT 


CACGGGCTTG 


TTCTTCTGTC 


AAACCTACTG 


CTGCTACTTC 


AGGGAGAGTG 


840 


TAGATGGCTG 


CAGGAGTCAA 


ATTCAATTTG 


GCAACTGCAT 


GATTTCCTTT 


AAGGGCATTT 


900 


TCAGCGGAAA 


CTTCACCCAT 


GCGGAAAGCT 


GCGTGAGCCA 


ACATCTTAGT 


ACCGTTGATG 


960 


TCACCTGGTG 


CATAAATGCC 


TGGAACTGAA 


GTTTCCATGT 


ATTCATTGAC 


CTTGATACAA 


1020 


CCACGATCCA 


ATTCAAACTC 


AACCTCTCCA 


ATACCTTCAA 


GGTCTGGCAT 


ACGACCAATT 


1080 


GAAAGAAGAG 


CTTTGCTTGC 


GATGATATTG 


TCTTTTCCTT 


CAACCTTGAT 


ACGAAGTTGA 


1140 


CCATTTTCCT 


CAATGATTTC 


TTGCAGTTTA 


GTACCAGTCA 


AGATGGTCAT 


TCCTTTACGC 


1200 


TCAAGGATCA 


AGCGAAGGTT 


CTTAGAAACT 


TCCGCATCCA 


TAG 




1243 



(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 679 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:164: 

CTGATAGTGG AGACTATCTT GGGTATTCTA GGTAGTAGAG GTATCCATTT AGCAGTTCAA 60 
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(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1024 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165 : 

AGCTCTCTCT CAGCCATTTT TTCAATATCT GCTACGATTT GTTCCTTGGC ATAGGGAGGG 
TCTAAGAAAA CGAGGTCAAA TTCCCCAGAN AACCTGTTCC AATGCCCTTT CTGCATCCAT 
TTTGGAGGAG TTGAAATTTT CCAACTTCCT TGGTCATCTG GATATTTTCA GCCACGGATG 
GTCTGAGCCT TACGGTCTCG CTCCACCAAA ACAGCACTGG ACATGCCACG CGATACTGCT 
TCGATAGATA AACCACCACT ACCTGCATAA AGGTCCAAGA CTCGTCCCAC TTCAAAGTAG 
GGACCAATCA TGTTAAAAAT GGCTCCCCTA ACCTTATCCG AAGTAGGTCT TGTTGTCTTG 
CCTTCTAGTG TCTTGAGGGG ACGTCCCCCA TAGATTCCTG ATACGATTTT CATACTGTTT 
ATTATACCAA ATTATAGACA AAAAGAGAAA GAAAACCGAA CCTTGCGGTT CGATTCTCTA 
CAAAATATTT TCGTAAGTAT CGCGGACTTA CTTGAGGCCA AACACTTGTT TGCACTTCTC 
CGATGTGTCT CTTGCGAAGT AGGAACATGG CCATACGAGA TTGTCCAATT CCTCCACCGA 
TTGTCAATGG GAATAGGCCA TTCAACAAAG ACTTGTGCCA TTCCAATTCT AAGCGGTCTT 
CATCACCTGT AATTTCCACC TGACGTCTAA GAGTTTCTTC ATCTACACGA ATTCCCATAG 
AAGACAACTC AAAGGCTCCA CCTAAAGACT CATTCCAGAC AAGAATATCA CCATTTAGAC 
CCTTGTAGCC ATTCTCAGAC TCGCTTGTCC AGTCATCATA GTCTGGTGCA CGTCCATCGT 
GCGGTTTACC ATCTGGCAAC TCGCCACCGA TACCAATCAA AAAGACGGCT CCAAATTCCT 
TTACAAATCG CATTTTCCAC GTTCTTTAGT GTCAAGTCTG GGTAGCGTTC TACCAATTCT 
TCTGTATGGA TAAAGGTGAT TTGTTTTGGC AAGATAGACT CGATGTCATA GCGGGCTTCA 
ACAG 
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GGTTGTGAGC ATGTCAATCG GGCTCTCGTT GTTGAACGTC AGGTGGCAGA GCAGTTTGAT 120 
CTGGAAATTG TCAGTGTCCA TCCTACTCTT CATGCAGGAG GTTCGGGGCA GTTGGCAGCC 180 
TTTAAGTTTA TGCAGGATCC AGTTGAGGTT GAATTTATCA AGGCTCATGC TGGATTGGAT 240 
ATCGGAGACA CTGCAATTGG CATGCATGTC AAGCATGTTC AGGTTCCGAT TCGCCCTATT 300 
TTGAGAGAGA TTGGTCATGC CCATGTAACG GCACTGGCTA GTCGTCCAAA ATTAATCGGA 360 
GGTGCGCGTG CGCACTATCC GCAAGACGCT ATTAGAAAGT CTTGAGAATG AATAAAGGAG 420 
ATAAAAGTGT TTAAAAAAGA CCGTTTTTCA ATTCGTAAGA TTAAGGGAGT TGTAGGCTCT 480 
GTATTTCTTG GAAGCCTTTT GATGGCTCCT TCTGTAGTGG ACGCAGCCAC CTATCACTAT 
GTAAATAAAG AGATTATTTC ACAAGAAGCT AAAGATTTAA TTCAGACAGG AAAGCCTGAC 
AGGAATGAAG TTGTATATGG TTTGGTGTAT CAAAAAGATC AGTTGCCTCA AACAGGGACA 
GAAGCATCTG TTTTGACAG 
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679 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1024 



WO 97/43303 



PC7YUS97/07950 



(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166 : 



CTTTTTTATT 


TCACAACAAG 


TTCATAACGT 


GTCTTACTGG 


TGAAGGTTTG 


ACCAGCTTTA 


60 


AGAATGACTT 


GGCCTTTAAG 


GTCACTGTGA 


ATGGCATCTG 


GTAAAGCTTG 


CGCTTCAAGA 


120 


GCAATCCCAT 


TGTGCTGTAG 


CATTGGCTGA 


CCTCCTATGA 


TGACACTTTC 


ATCCACAAAG 


180 


TTTGCTGTGT 


AGACCACAAA 


GCAAGGAGCT 


TCTGTCTTGA 


AAAGCAGGAA 


GCGACCTGAA 


240 


TTTTGGTCAT 


AAAGGAATCC 


AGCATTGTCA 


TGGCCTGCAG 


GAAGGGCAAA 


TGGATGATCC 


300 


AAACCTGATG 


CCAGCTGGAT 


TTGCTCATCT 


TCTTCTGCAA 


AGATATCCTT 


CAACAAGGCA 


360 


CCATTGTAGA 


TGTGTTTGAC 


CACATCACGG 


TTGGCTTCTG 


GAGTTTTGGC 


AGGAACACCG 


420 


TCAGGAGCGA 


TTGAGTAAAT 


GCCCTCTGTG 


TTTAGTTGGA 


AGACATGACG 


GTCAATCGTC 


480 


TGCGTGAAAT 


CACCAGACAA 


GTTGAAATAG 


CTGTGGTTGG 


TTGGATTGAC 


CAGCGTATCC 


540 


TGATCGGTCG 


TTACCTTGTA 


GATCGAATTC 


ATGGAGGCAC 


CAGTTTCTTC 


CAAGTGATAA 


600 


CTGATCCAAA 


TCTTGAGATT 


TCCAGGGAAC 


CCTCCTGTCC 


CATCTGTACG 


CTCTGTGTAG 


660 


AGAGTCAAGC 


CATGATCGCT 


TACTTCTTCA 


ACTTCAAACA 


AG 




702 



(2) INFORMATION FOR SEQ ID NO: 167: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 677 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 167: 



CTCAATATTT 


CTTGTCCCAA 


TGTTGACCAC 


TGTAATCATG 


GACTTTTGAT 


TGGTCAAGAT 


60 


CCAGATTTGG 


CTAATGATGT 


GGTGAAAGCA 


GCTGATTGGA 


AGCCTCGAGT 


GCCAGTTTAT 


120 


GTCAAATTAA 


CCCCGAGTGT 


GACCGATATC 


GTTACTTTGC 


AAAAGCTGCA 


GAAGATGCGG 


180 


GAGCAAGTGG 


CTTGACTATG 


ATCAATACTC 


TGGTGGGAAT 


GCGCTTTGAC 


CTCAAAACCA 


240 


GAAAACCAAT 


CTTGGCCAAT 


GGAACAGGTG 


GAATGTCAGG 


TCCAGCAGTC 


TTTCCAGTAG 


300 


CCCTCAAACT 


CATCCGCCAA 


GTAGCCCAAA 


CAACAGACCT 


GCCTATCATT 


GGAATGGGGG 


360 
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GAGTGGATTC GGCTGAAGCT GCCCTAGAAA TGTATCTGGC TGGGGCATCT GCTATCGGAG 420 

TTGGAACAGC TAACTTTACC AATCCTTATG CCTGCCCTGA CATCATCGAA AATTTACCAA 480 

AAGTCATGGA TAAATACGGT ATTAGCAGTC TGGAAGAACT CCGTCAGGAA GTAAAAGAGT 540 

CTCTGAGGTA AACTGCAATC AATCTGTTCT TGATTTTTTA TTAGTTTGTA ATATGAATTT 600 

AGGAGAATTT TGGTACAATA AAATAAATAA GAACAGAGGA AGAAGGTTAA TGAAGAAAGT 660 
AAGATTTATT CTTTTAG 



(2) INFORMATION FOR SEQ ID NO: 168: 



677 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 



ACGCATTTCA TTCAACGGAA ATGGAGACTA CCCTTGGGTT GGCGTAGGCT TGCCAAAGTT 60 

TTCTGGCTAG TTCTGGCCCC ATAGGGACCA TATTGAACAA AATCAGCTTC ACGTTCACCG 120 

TTTTGGTCTT TGTAACGACG GTTCACAGCG ATAGTTGCTC GCGCTACCGA CTTGTCATTG 180 

TTGGTTTTGT GCAATTCTGG TGTAGACGTT AAACGCCCAA TCATGATAAC TTTATTATAC 240 

ATATTTTCTT CCTCCTACTT ATCTATTCGT AGGAAATCAA AAAAGTTACA GAAATTTGTA 300 

ACTTTTCGAG GAAATTTCTT ATTTTTTATG AATCATGAAA CCTCTCGCCT GTTGATTGGC 360 

CATAATGGTC ATATCTGTAA TCTGAACACG GCGAGGCTGA CTGGTCACAT AGACTACTGT 420 

GTCTGCAATA TCCTGAGCTT GCAAGGCTTC TATTCCTTGG TAAACGGACG CAGCCCGCTC 480 

TTTATCACCA TGAAAACGAA CAGTTGAGAA ATCTGTTTCG ACAATCCCAG GCTGAATGGT 540 

TGTCACCTTG ATATCCGTTG CGATGGTATC AATTCGCAGT CCATCCGAAA AGGTCTTAAC 600 

CGCAGCCTTG GTAGCTGAGT AAACAAGGCG GGCACCCAGG CATAGGCGGT AGGATTCCTG 660 

GCGGGTTGAA CCCCATATTT TGGATAAACG GGGACCTTGG ATTGAACTTT TAACCATTGG 720 

CTGGGCAAGG AAACAGCGGA GTAACCGCCA TCAGAACCTT TAACATTGGT ATCCAACATG . 780 

GTCAACATAT CCAACTCCTC ATAGTCTTGA TAGGGAGCCA AGCCAAGAGC CAGTCCGGCG 840 

TTGTTGACCA AGATATCAAT CCTCCCTATC GTTTCTAGAA TATCAGAGCA AACAGTCTTC 900 

ACCATGACCA TATCCGTCAC ATCTAGTGGA AAAGTCCAGA CGGTTTGATT TGGAAAGCTT 960 

ACTGCAAACT CCGACTTGAA GGATTTCTAA TCTGTCTATC CGTCGTCCTG TTAGAACGAC 1020 

ATCCTCACCC TGCTCCAGAT AAGCACG , n An 



(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1165 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

CTTGTTGGAT ATAAGGATTG CGAGTTGCAT AGCGAAGAAC AATAATCGCG GCAAATAGCC 60 

CATAAAGAGA GGTAGAGGCG CCTGCTGCTA AGGATTTAGG ACTAAATACA AAAACAAAGA 120 

GATTGCCCAT CATTCCTGAT AAAAGATAGA GAAAGAAAAA CTGCTTAGAA CCGAAAATCT 180 

CATATACCTG CATTCCAAGA TAATAAAGTG AAAGCATATT AACAATGAAA TGTTCCCACC 240 

CAATATGAAC AAAAATGGCA GACAAGAGAC GCCAAACCTG CTCGGGAAAG AGGCGAATAG 300 

CTGGACCATA CATGGCTCCA AATCGAAATA ATGTATCTGC CCTGTCAAAG TTTCCGCCTG 360 

CAGTGACCAA CATTAGTAAA AATACCAAGG CCGTCACTAA GAGGAAGAAA CTCGTCACAG 420 

GGTAACGTCT ATCAAAGATT TCCTTCATCA ATTAATACCT CCTGAACAGG AATATCATGG 480 

TTTTCAGGTA TAAAGTCCTG AATTTGACAA GGATATATCG TACTCAAAGC ACGACCAGAA 540 

AAATGTTCCA GATAGCGGTC ATAATAGCCT CCACCGTATC CTATCCGATA TCCTTTCGTC 600 

GTAAAAGCCA GACCAGGAAC ATGAATCAAA TCAATCTTTG AAACATCCAC AACTTCCAAA 660 

TCTCCCTGTG GCTCCAGTAA TCCAAAGGAA GTTTTTACCA ACTGTTGCGG ATCATAGACC 720 

ACAAAGTCCA TGCGCCCCTT GGGATAGGTC TTAGGTGTCA AAACCCTCTT GCCGTCCTTC 780 

AGCGCCTGCT CAATCAGTTC CCGCGTTTGA AACTCATGAG AAAAAGAGAG GTAGGTTGCG 840 

ATGACCTTGG CTTCTTGATA AAAGGGGTGT TGTAAAATTC GCTCGGTTAA AGCTTGGTCT 900 

ATAGCCTGTT TTTGTTCTTG AGATAAAGCC TTCATTTCAT GCAAGACTTG CTTGCGTAAT 960 

TCCGATTTCA TAGACAAGCC CTCTATTCTG CTGCCTTCTT TTTCAGGAAA CTAGACACCG 1020 

CAGCCACCCC AATAGCTAAG ACTTCTTCCT TAGGACTCAT TTGAGGGTGA TGAAGAGCGT 1080 

AGGGACTATC GATACCTAGC CAAAACATCA CACCATCAAC CTTTGAAAGG AGATAACCAA 1140 

AGTCCTCGCC TGTCATAGCA GGTTC 1165 

(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 

(B) .TYPE: nucleic acid- 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

CTGGGCGACC TTGAGAGTGG CTTTACTGGG GCTGATTTTG GGAGTTTTGA TTGCCTGTCT 60 
TATGGCTGTA CTCGATGGAT AGTTTGACTT GGCTCAATGA CCTGATTTAC CCTATGATGG 120 
TGGTCAATCA AACCATTCCC ACCATTGCCA TAGCTCCTAT CCTGGTCTTG TGGCTAGGTT 180 
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ATGGGATTTT GCCCAAGATT GTCTTGATTA TCTTAACGAC AACCTTTCCC ATCATCGTCA 240 

GCATTTTGGA CGGTTTTAGG CATTGCGACA AGGATATGCT GACCTTGTTT AGTCTGATGC 300 

GGGCCAAGCC TTGGCAAATC CTGTGGCATT TTAAAATTCC AGTCAGCCTG CCCTACTTTT 360 

ATGCTGGTCT GAGGGTCAGT GTCTCCTACG CTTTTATCAC AACAGTGGTA TCTGAGTGGT 420 

TGGGAGGCTT TGAGGGGCTT GGTGTTTATA TGATTCAGTC CAAGAAATTG TTTCAGTATG 480 

ATACCATGTT CGCTATTATT ATTCTGGTAT CGATTATCAG CCTTTTGGGT ATGAAGTTGG 540 
TCGATATTAG TGAAAAATAT GTGATTAAAT GGAAACGTTC GTAGAATTAG AATGTTTCTG 

AAAAAGAAAA GAGGAAATCA AAATGAAGAA AACATGGAAA GTGTTTTTAA CGCTTGTAAC 660 
AG 



600 
660 
662 



(2) INFORMATION FOR SEQ ID NO: 171: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic UNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171 : 

GGAATTCGAT CTCTTAAAGC CTATTATGGT TAATCCAATT TGTTTTAGCT GCAGGTGGAA 60 

TTATGAATGT CGGATATGAA AAAGCATTCT TGATGCAGAC ATCGTTAAAT TTGCCAACTT 120 

CTGAAATTAT CTCGACATAT GTCTATAAAG TTGGTCTTGT ATCAGGAGAC TATTCTTACT 180 

CAACAGCGGT TGGTTTGTTT AATGCAGTGA TTAACGTAGT ATTGCTTGTT GCAGTTAACC 240 

AAATCGTTAA ACGCATGAAT AATGGTGAAG GAATTTAAGG AGGAAAGTAT GAAAAATTCG 300 

ATTATGGATA CAAAATTTGA TAGACGTATC TTACTCTTAA ATAAAATCAT TATTGTCTTT 360 

ATCGTTTTGA TGACTTTGCT TCCTTTACTT TATATCGTCG TAGCATCCTT TATGGATCCT 420 

AAGGTTCTGG TTAGTAGAGG GATTAGCTTT AATCCAGCCG ATTGGACTGT AGAAGGTTAC 480 

CAGCGTGTAT TCAGTGACCA ATCTATTCTA AGAGGTTTTA TCAATTCTCT ACTATACTCT 540 
TTTGGATTTG CAGCT 



555 



(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 860 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 



CTCATAATCA TATTTTGGAT TCGCAAATTG AGGGAGTTAT TTCAACGGCC GATATTATTG 60 

ACAAAGCTGG AATCACTCCA ATCGGAGTTT ATACGCACGA ACCACGTGAT CAGGTTCCGC 120 

TGGTCATTAA GGAAGTGAAT GGTATCAAGG TTGCATTGTC AGCCTATTCC TATGGTTTCA 180 

ATGGAATTGA GCAGTATATT TCTCAGGAAG ACTATAATCG TTATCTTTTA GATTTAAACG 240 

AAGATAAGAT GAAGGCTGAA GTTGAACGGG CAGAGAAGGA AGCAGATATC ACCATTATCA 300 

TGCCTCAGAT GGGTGTTGAG TATCGATTGG AACCAACTGA AGAACAAAAA GCTCTTTATC 360 

ACAAGATGAT CGATTGGGGA GCGGATATTA TCTTTGGAGG GCATCCTCAC GTTGTTGAGC 420 

CATCTGAAAC GGTTGAAAAA GATGGAGATA AGAAACTCAT TATCTATTCA ATGGGGAACT 480 

TCATTTCCAA TCAACGAATT GAATCTATGG GAGATGAAGA GAATGCTAAG TGGACTGAAC 540 

GTGGTGTTCT CATGGATGTT CACCATCAAG AAGAATGATG GAAAAACAAC TATCGGAACA 600 

GCTAAAGATC ATCCTACTTG GGTCAATCGA ACACCAAAGG GAACCTTTTC ACCAGAAGGA 660 

TATCCCTTGT ATCATTACCA AACTTATATT TTGGAAGATT TTATAGAGGA TGGCAGTCAT 720 

CGTGACCAGT TAGATGAAGC GACTACCGGA ACGAATTGAT ACCCCTATAA AGAAATGAAT 780 

GAACATGTGG GATTGAAGTG GTATTACTTT GAATCCCGAG GAAAGTCCTG ATGATTAAGG 840 
TAATTGCGAC AGATATGGAT 



860 



(2) INFORMATION FOR SEQ ID NO: 173: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1284 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



CTCATTTGCA 


AAATCAGGAA 


AAACGGATGG 


TAACGGCAGT 


CCGAAATGTT 


CTATCTAAGA 


60 


AACAAGAGGC 


TTTGAAAAAA 


TGCAGTCAGT 


CTGTTATCTT 


TAGACAACCT 


GAGCGCTTGT 


120 


ATGACGGTTA 


TTTGCAACGC 


TTGGACCAAC 


TGCAACTGCG 


TTTGAAACAA AGTTTGCGAA 


180 


CTCGGATTTC 


TGATAACAAA 


CAATTAGTTC 


AAGCAAGAAC 


TCATCAATTA 


GTACAATTAT 


240 


CACCTGTTAC 


CAAAATCCAA 


CGCTATCAAG 


ACCGTTTAGG 


ACAGTTGGAC 


AAGCTTCTTA 


300 


GGTAGCCAAA 


TGGCGTTAGT 


TTATGACGCC 


AAGGTTGCTG 


AGGCCAAGCG 


ACTTTCGGAA 


360 


GCTTTGCTCA 


TGTTGGATAC 


TAGCCGAATC 


GTGGCGCGTG 


GTTATGCTAT 


TGTCAAAAAA 


420 


GAAGAATCCG 


TTGTAGATTC 


GGTTGAGAGT 


TTGAAGAAAA 


AAGACCAAGT 


AACGCTTTTG 


480 


ATGCGAGATG 


GTCAAGTAGA 


ATTAGAGGTT 


AAAGATGTCA 


AAACAAAAGA 


AATTTGAGGA 


540 


AAATCTAGCA 


GAACTGGAAA 


CCATTGTCCA 


AAGTTTGGAA 


AATGGTGAAA 


TTGCTCTGGA 


600 


AGATGCGATT 


ACTGCCTTTC 


AAAAGGGCAT 


GGTCTTGTCA 


AAAGAGCTCC 


AAGCTACGCT 


660 


GGACAAGGCT 


GAAAAGACCT 


TGGTCAAGGT 


CATGCAAGAA 


GACGGAACAG 


AAAGTGATTT 


720 


TGAATGAAAA 


AGCAAGAAAA 


ATTAGCTCTT 


GTCGAGTCGG 


CTTTGGAAGA 


TTTTATGGAG 


780 
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ACCAGCAGTT TGCCTCTAGT TTACGGGAGT CTGTTCTCTA TTCTATTCAT GCTGGTGGCA 
AGCGTATTCG GCCTTTTCTC TTGTTAGAAG TTCTGGAAGC CTTGCAGGTT ACCATCAAAC 
CTGCTCNCGC GCAGGTAGCT ACTGCCTTGG AGATGATTCA TACAGGGAGC TTGATTCACG 
ATGACCTTCC TGCTATGGAT GATGACGAGG ATCGAGAGAG GGCGGAAAAA CCAATCACAA 1020 
GAAATCCGGT GAAGCTATGG CCATCCTAGC TGGAGATGCC TCATGCTTAG ACCCATATGC 1080 
CTTGATTGCG CAGGCAGATC CGCCAAGTCA GATCAAGGTG GGCTCGATTG CCAACTCATC 1140 
CCTTGCTTCA GGTAGCCTGG GTATGGTGGC AGGGCAAGTC TTGGATATGG AGGGCGAACA 12 00 
CCAGCACTGG TCTCTGGAAG AACTTCAGAC TATGCATGCC AACAAGACTG GGAAGTTACT 1260 
AGCCTATCCC TTCCAACGCG GCAG 

1284 



(2) INFORMATION FOR SEQ ID NO: 174; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 978 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 



CCACTATCAA GCATCTCTCT TGCAAACTTT ATCGCACTAT CTCCATCACC TTTATAATAT 60 
ACATGAATAG TCAATGTCAT CTTATATCCT CCAAAATCAT CCTTCAATTT TTAAAAAACA 120 
AGTTTAGATG AGGATCTAAA CTTGTTTTTT ATGAACTAAT TATCTAACGT TTCGCCATTA 180 
CTTTCAATCA CTTCTTTATA CCAATAAAAT GATTTTTTCT TATAGCGATT TATAGTCAAT 240 
TGAAACAAGA ACAAGACAAA AGCACCCATA AGGTAAACTT TTGAGTGCTT TGAAACGTTG 300 
ATATAATCGT ATTGATTAAC GTTTTGAGGC AACTGACCTT GTCAGGTTGC AGCCACATTT 360 
GTAAGCGACT AAAGTCGCAA CAACTGTGTC AATTGCACCA ATTTAGTAAG AAAGTATAAA 420 
AAAAGAACAC CCCGAAAGGT GCTCTGTTAT AAGTTATAGT AATTCTTTCG AATTAACGTT 480 
TACTAAATTG TGATGCTTTA CGAGCTTTCT TAAGACCTGG TTTCTAACGT TCAACTTTAA 540 
CGGGAGTCAC GTGTAAGAAG TCCTGCGCGT TTCAATGAAT CGCGGAAGTC TGGGTTTACT 600 
TGAAGAAGGG CACGAGCGAT ACCATGACGG ATAGCTCCTG ATTGACCAGC GTATCCACCA 
CCTACAACGT TAACGAAAAC GTCGTATGAA CCTACAGTTG AAGTAACTGC GAATGGTTGG 
TTGATGACAA GACGAAGGTC AGCGTGTGGG ATGTACTCTT CAACATCTTT TTTGTTAACA 
GTGATTTTAC CACTTCCTGG AACAAGCCGA ACGCGTGCAA CAGCGTTTTT ACGACGTCCA 
GTACCTGCAT ATTGTGCTTG TGACATACTT TATTGTTCCT TTCCTTAGAT AAGTCCTGAA 
ATGTCAAGAA CTTCTGGTTG TTGTGCAGCG TGAGTGTGCT CAGCTCCAAC AAATACTTTC 
AACTTCATAC CTTGAGCG 



660 
720 
780 
840 
900 
960 
978 



(2) INFORMATION FOR SEQ ID NO: 175: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 874 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

CTTCGGGTTG GGCAGCTTAT TTTAAAGGCT TGCTCTCTCA ATACGGGATA GCCTTTCCTC 60 

AGGCTTTAAA TGGTACCTTT AATCCTCAAG CAGGGACATT TGTTGATTTA TTGCCTATTC 120 

TTGTCTTGGT CTTGGTGACT TCGCTTGTTT TACTAAATGC CAAGGCAGCC TTACGCTTTA 180 

ATTCGATTCT AGTTATTTTG AAATTTTCCG CTTTAGCTCT CTTTGTATTA GTAGGAATTT 240 

GGAATATTAA GTTTGATAAT TGGAGCGGAT TTGCTCCTTA CGGTTTTTGG ACAAATCTAT 300 

GGAGCTAGCA CTGGTATTAT GGCTGGTGCG TCCTTGATGT TCTTCGGTTT TTTGGGATTT 360 

GAATCCATCT CTAATGGGGG GTCTGTAGAT GAAGTCAAGA CTCCTCAAAA AAATATTCCT 420 

AGAGGGATTG TCTTATCGCT TTCTATCGTA ACCATTCTCT ATGCCTTGGT GACCCTTGTT 480 

TTGACTGGTG TGGTTCACTA TAGTCATTTA AATGTCGATG ATGCCGTTGC CTTTGCCCTT 540 

CGTAGTGTTG GGATTAGTTG GGCAGCCAAC TATGTGTCAT TAGTGGCTAT CTTGACCTTG 600 

ATTACAGTTT GTATTTCGAT GACCTATGCC CTATCGCGTA TGATTTACAG TTTAGCAAGT 660 

GACGGCTTAG TGCCTGCTGC CTTTAAGGAA CTGACGAAGA CTAGCAAGAT ACCAAAGAAT 720 

GCTACTATTT TGACAGGTCT AGCTTCAGCA GTAGCAGCAG GAATGTTCCC ACTAGCCAGT 780 

ATTGCAGCCT TTTTAAATAT TTGTACCTTA GCCTAATTGA TCCTGCTGGC TTATGGTCTG 840 

ATTCGCTTAC GGAAAGAAAA AGGAATGCCC AAAG 874 



(2) INFORMATION FOR SEQ ID NO: 176: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 6: 



CTAACCCTNG 


ACGGGGCCGC 


TATCATCAGT 


CAAACAGCTA 


AAAATCTTGT 


CTGCAAAAGT 


60 


CTCGATTAAC 


TGAGCTTTTA 


CAAAAGCCGT 


ATTTCCTGGA ATAACTTGGA GATTGATCAT 


120 


CTTATCCATC 


AATTCAGCCG 


ATTCGATATT 


GTCTTCAGCC 


AGTTGCAGAC 


TTTTTACGAT 


180 


TGATTTTGGC 


AATTCGTAGA 


CATAGGTGTT 


GTCTCTCAAA 


GGAATTTTGA 


CAATACCTAA 


240 


CTCTTTGATA 


TCTCGGGATA 


CCGTCGCCTG 


AGTGGCAGTG 


ATACCTGCTT 


CTTTCAAATG 


300 


TTCTACAATT 


TCTTCTTGCG 


TGCCGATTTG 


ATAATCTGTC 


ACCAATCTTC 


TAATTTTTTC 


360 
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AAGTCTCTCT TTTTTATTCA TTTTTAAATT GACTATGCGC CCTCTCTACT GCTTCTTTAA 420 

TCTCAGCAAG AATCTGATTG CTTGCTGACT TTTCTTTTTT CAAATACACT AAAAATTCAA 480 

TATTTCCATG TCCACCTTGG ATGGGAGAAA AGTCCAAGCC AAGGACTGAA AAACTGCCTC 540 

TACTGCCATA G 551 

(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 774 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

CTCTTACAGG AAGAAACTCG TCGTGACCTT GCTAAAGAAG TGAAGAAGTT CGGCGAAAAT 60 

GCTAAAGTGG CTGTCCGCAA TATCCGTCGC GATGCTATGG ACGAAGCTAA GAAACAAGAA 120 

AAAGCAAAAG AAATCACTGA AGACGAATTG AAGACTCTTG AAAAAGATAT TCAAAAAGTA 180 

ACAGACGATG CTGTTAAACA CATCGACGAC ATGACTGCTA ACAAAGAGAA AGAACTTTTG 240 

GAAGTCTAAA AATAAACAGA AAAACTCAGT TGGCATTGCT GGCTGAGTTT TATTCGAAAG 300 

AAGGAAATAT GAATACAAAT CTTGCAAGTT TTATCGTTGG ACTGATCATC GATGAAAATG 360 

ACCGTTTTTA CTTTGTGCAA AAGGATGGTC AAACCTATGC TCTTGCTAAG GAAGAGGGCC 420 

AACATACAGT AGGGGATACG GTCAAAGGTT TTGCATACAC GGATATGAAG CAAAAACTCC 480 

GCCTGACAAC CTTAGAAGTG ACTGCCACTC AGGACCAATT TGGTTGGGGA CGTGTCACAG 540 

AGGTTCGTAA GGACTTGGGT GTCTTTGTGG ATACAGGCCT TCCTGACAAA GAAATCGTTG 600 

TTGTCACTCG ATATTCTCCC TGTGCTCAAG GAACTCTGGC CTAAGAAGGG CGACCAACTC 660 

TACATCCGTC TTGAAGTGGA TAAGAAAGAC CGTATCTGGG GCCTCTTGGC TTATCAAGAA 720 
GACTTCCAAC GTCTCGCTCG TCCTGCCTAC AACAACATGC AGAACCAAAA CTGG 



774 



(2) INFORMATION FOR SEQ ID NO: 178 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 
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CCTGCTCACG ATAAGGTCTT TGATGTANAA GTTCTTGTTG AAGGTAAGAG CATCGGTCAA 
GGCCAAGGTC GTTCTAAGAA ATTAGCAGAG CAGGAAGCTG CCAAAAATGC CGTTGAGAAG 
GGGCTGGATT CATGTATTTA AAGGAAATCG AAATTCAGGG GTTCAAGTCT TTTGCTGATA 
AGACCAAGGT CGTTTTTGAC CAAGGTGTGA CGGCAGTTGT TGGACCCAAT GGATCTGGAA 
AGTCCAATAT TACAGAAAGT CTGCGTTGGG CTTTGGGGGA GTCTAGTGTC AAGAGTCTCC 300 
GTGGGGGCAA GATGCCGGAT GTCATCTTTG CTGGAACCGA AAGTCGCAAA CCGCTCAATT 360 
ATGCTTCTGT AGTTGTGACT CTGGATAATC ATGACGGATT TATCAAGGAT GCAGGTCAAG 420 
AAATCAGGGT GGAACGCCAT ATCTATCGTA GTGGAGATAG CGAATACAAG ATTGACGGCA 
AGAAAGTCCG TCTGCGTGAT ATTCATGACC TCTTCTTGGA TACTGGATTG GGACGAGATT 
CCTTCTCTAT CATTTCCCAA GGCAAGGTTG AGGAGATTTT TAATTCTAAG CCTGAGGAAC DUU 
GCCGGGCTAT TTTTGAAGAA GCTGCAGGAG TTTTAAAATA CAAGACTCGT AGAAAAGAAA 660 
CTGAGAGTAA ACTGCAACAA ACTCAGGATA ATCTGGACCG CTTAGAGGAC ATTATCTACG 720 
AGTTGGATAA TCAAATCAAG CCTCTTGAGA AGCAAGCTGA GAATGCCCGT AAGTTTTTAG 780 
ACTTGGAAGG ACAACGTAAG GCTATTTATT TAGACGTTCT GGTTGCTCAA ATCAAGGAAA 840 
ATAAGGCAGA ACTAGAGTCG ACAGAAGAAG AGAGTTGGCT CAGGTTCAAG AACTCTTGAT 900 
GAGTTATTAC CAAAAGCGTG AAAAATTAGA AGAAGAAAAT CAAACTTCTA AAAAGCAACG 960 
CCAAGATTTA CAGGCTGAAA TGGCCAAAGA CCAAGGCAGT TTGATGGACT TGACTAGTCT 1020 
GATTAGTGAT TTAGAAAGAA AATTAGCCCT ATCGAAACTG GAGTCCGAGC AAGTGGCCCT 1080 
GAATCAACAG GAGGCACAAG CCCGTTTGGC TGCTTTGGAG GATAAGAGAA ATTCACTCAA 1140 
CAAAGAAAAG TCTGATAAAG AAAGCTCACC AG 



60 
120 
180 
240 



480 
540 
600 



1172 



(2) INFORMATION FOR SEQ ID NO: 179 j 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

CTGTTATTCC TGCTGCAGGG CTGGGCACAC GATTTTTGCC TGCCACTAAA GCTTTGGCAA 60 

AAGAAATGCT TCCAATTGTA GACCGCCCCA CAATTCATTT TGTCATTGAA GAAGCTTTAC 120 

GTTCGGGTAT TGAAGATATT CTAGTAGTTA CTGGAAAGAG TAAACGTTCT ATTGAAGATT 180 

ATTTTGATTC AACTTTTGAA TTGGAATATA GTCTTAGAAA ACAAGGAAAG ATGGAACTTC 240 

TTAAGTCAGT TAATGAATCG ACTGATATTA AAGTACATTT CGTTCGTCAA AGTTCACCAC 300 

GTGGTCTTGG TGACGCTGTT CTCCAAGCGA AGTCTTTTGT TGGTGACGAT CCCTTTGTTG 360 

TAATGCTTGG GTGATGACCT TATGGATATC ACCGACTCAA CTGCTGTACC TTTAACAAGA 420 

CAATTGATGG ATGATTACAA CGCAACACAG GCTTCAACTA TCGCAGTAAT GCCTGTTAGA 480 

TATGAAGAAG TTCTCCTAAT GGTGTGATTC TCCCCAGAAT TGGAAAGTTA GTAATGGCCT 540 

CTATAGTGTT GATGCCTTTG GTTGAGAACC AAACCAGAAG AAGCGCCTAG CAATTAG 597 
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(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 836 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DMA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:180: 

CTCCCTAGTG TGGTAGATTA GGCTATTTTT TGGTATAATA AAAGTTATGG AAATCGAAAA 
AACCAATCGT ATGGAATGCG CTCTTTGAAT TTTATGCGGG CGCTTTTGAC AGATAAGCAA 
ATGAATTATA TAGAGCTTTA CTACGCTGAT GATTACAGTC TTGCTGAGAT AGCTGAGGAG 
TTTGGTGTTA GTCGTCAGGC TGTCTATGAC AATATCAAGC GAACAGAAAA GATTCTGGAA 
GATTATGAGA TGAAATTGCA CATGTACTCG GACTACATTG TCCGTAGTCA GATTTTTGAC 
CAAATCTTGG ATCGCTATCC CAAGGATGAT TTTCTGCAGG AGCAGATAGA AATTTTAACA 
AGCATTGATA ATAGAGAATA AGAGGAAGAA AAATGGCATT TGAAAGTTTA ACAGAACGTT « v 
TGCAGAACGT CTTTAAAAAT CTACGTAAAA AAGGAAAAAT CTCTGAATCT GATGTCCAAG 480 
AGGCAACCAA AGAAATTCGC TTGGCCTTGC TCGAGGCCGA CGTTGCCTTG CCTGTTGTAA 540 
AGGACTTTAT CAAGAAAGTT CGTGAGCGCG CAGTCGGGCA TGAGGTCATT GATACACTTA 600 
ATCCTGCGCA ACAGATTATT AAAATCGTTG ATGAGGAACT GACAGCCGTT TTAGGTTCTG 660 
ATACGGCAGA AATTATCAAG TCACCTAAGA TTCCAACCAT CATCATGATG GTTGGTTTAC 720 
AAGGGGCTGG TAAAACAACC TTTGCTGGTA AATTGGCCAA CAAACTCAAG AAAGAAGAAA 780 
ATGCTCGTCC TTTGATGATT GCGGCGGATA TTTATCGTCC ACCTGCCATT GACCAG 836 

(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:181: 



CGGGTTTCAA ACTCTTCTCT TGTCTTGGGC AGACTGCGGT TCCGGAAGNC TTCCAGATAA 
CGTTCAATTT CATCTAGCAA ATCAGAAGCA GGATTGGTCT GGCTCAGTTG ACCTGCAATT 
TTTGAAAAGA GTTGCGCTAA GATCAGGCTT TCACTGGCGG CAAGGTGACA AGTGTTAATC 

215 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 



WO 97/43303 



PCT/US97/D7950 



TGTTGGGCCA TGTTTCTCAG GGATACGACT TTGTCGTTGT CTCATCTCAA AGTAGTGGAT 240 

ATGGTAGTCG GTCTGGTGAA AGAGGTGGTC AGAGTGATCC AAATAGACCA GTCTGAGGGC 300 

TTCTTTCAAA AGCGTGTCTA ATTCTGCTAC CAGCTGTGCT CGGTTGCGTC CGTCTCCTCT 360 

GGATAAATAG TATTTGAAGC GCTGGAGGAT ATCTTTTAAC TTTTCTTCCA CCAGCGTGTG 420 

GTAGTGCTGG ATTTCCTCTT CTCGTGAAGG CATATAGAGA TTAACAAGCA AGGCAAATCC 480 

TGTACCAATA GCAAAGAGAA GGAATTCATT GACTAGAAGG TCTGGAGAGG TTGACTCTTG 540 

AACCAAGAGA TGGCTAACCA AAACAGTGCT TGGTGTGATG CCAATTTCCC AGCCCATCTT 600 

GTAGGCTAAA GGAACGTAGA AGGCCAGATA GAGGCCGAGA CTCCAGATAT GAAATCCGCT 660 

CAAGTGAAAA GCTAGAACAC CCGATAGCCC CGAGACCTAG AAGCATAGAA AAAAGACGAT 720 
TGCGAGCCAG TTTTAAAGTA CTTCTACGCG TATCAGATAG GCTCAAGAGA GCGATCATTC 



780 



CAGCCGAAAC TGCTGATGAA AGATTGAGAA AATAAGCAAG CAGGCAGGCA AGACAGGTAG 840 
(2) INFORMATION FOR SEQ ID NO: 182: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1168 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 



GGATGGAATT GTTAGCAAAT GCCAACAAAA TCATGGAAGG AAAAGTTGTG CTAAATTGGG 60 

ATATCCAATA CCTTGGCTGG GGTGACTATG GTAAGGAAAA TGTCAGTTAT CAACATCATC 120 

TGGTGAAAAC TATGATATCG CCTTTGCAGA TAACTATATT GTAAATGCTC AAAAAGGTGC 180 

TTATGCTGAC TTGACAGAAT TGTACAAAAA AGAAGGTAAA GACCTTTACA AGGCACTTGA 240 

CCCAGCTTAT ATCAAGGGTA ATACTGTAAA TGGTAAGATT TATGCTGTAC CAGTTGCAGC 300 

CAACGTTGCG TCATCTCAAA ACTTTGCCTT CAACGGAACT CTCCTTGCTA AATATGGTAT 360 

CGATATTTCA GGTGTTACTT CTTACGAAAC TCTTGAGCCA GTCTTGAAAC AAATTAAAGA 420 

AAAAGCTCCA GACGTAGTAC CATTTGCTAT TGGTAAAGTT TTCATCCCAT CTGATAACTT 480 

TGACTACCCA GTAGCAAACG GTCTTCCATT CGTTATCGAC CTTGAAGGCG ATACTACTAA 540 

AGTTGTAAAC CGTTACGAAG TGCCTCGTTT CAAAGAACAC TTGAAGACTC TTCACAAATT 600 

CTATGAAGCT GGCTACATTC CAAAAGACGT CGCAACAAGC GATACTTCCT TTGACCTTCA 660 

ACAAGATACT TGGTTCGTTC GTGAAGAAAC AGTACGACCA GCTGACTACG GTAACAGCTT 720 

GCTTTCACGT GTTGCCAACA AAGATATCCA AATCAAACCA ATTACTAACT TCATCAAGAA 780 

AAACCAAACA ACACAAGTTG CTAACTTTGT CATCTCAAAC AACTCTAAGA ACAAAGAAAA 840 

ATCAATGGAA ATCTTGAACC TCTTGAATAC CAACCCAGAA CTCTTGAACG GTCTTGTTTA 900 

CGGTCCAGAA GGCAAGAACT GGGCAAAAAA TTGAAGGTAA AGAAAACCGT GTTCGCGTTC 960 

TTGATGGCTA CAAAGGGAAA CACTCACATG GGTGGATGGN AACACTGGTA ACAACTGGAT 1020 

CCTTTTACAT CCAACGAAAA ACGTTACAGA CCCAACCAAA TCCGAAAAAT TCCTAAAGAA 1080 

AAGAAATTTN GGGCAGAAAG CCTAAAAGAA TCCTCCCCAG CGCCTTTGGA ATTTTATCCT 1140 
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TTTTAAATAC CTTGGAACAA TGTTGAAA 

{2) INFORMATION FOR SEQ ID NO: 18 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 737 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:183: 

ACTTCTGGAG CAAGTGACAC GATCTTCATG AAACCACCTT CACGCAATTC ACGTGCAACT 60 

GGGCCAAAGA TACGTGTTCC GCGAGGAGTT TTGTCTTCAC GGATGATAAC TGCTGCGTTT 120 

TCGTCAAATT TGATGTATGA ACCATCAGCA CGACGAGCAC CTGATTTAGT ACGAACGATA 180 

ACTGCTTTAA CAACGTCACC TTTTTTAACC GCACCACCAG GAGTAGCTTG TTTTACAGAT 240 

GCCACGATAA CATCACCGAT GTTTGCAAAT TTACGTCCTG AACCACCAAG AACTTTGATA 300 

GTCAAGATTT CGCGAGCACC GCTGTTGTCT GCGACTTTCA AACGAGTTTC TGTTTGAATC 360 

ATTTCAGTTT TCTCCTTTCA GGTTTGATTA GATGATGACC GCTTCTTCAA CAACTTCTAC 420 

AAGACGGAAA CGTTTTGTAG CTGAAAGCGG GCGAGTTTCC ATGATACGTA CGATATCACC 480 

TTCTTTGGCA ACATTGTTTT CATCATGAGC TTTGTATTTT TTAGAGTAGT TAATACGTTT 540 

ACCATAGACT GGGTGGTTAC GTTTTGTTTC AACTACAACT GTGATTGTCT TGTCCATTTT 600 

GTCAGATACA ACACGTCCAA CAAGAACTTT ACGATTATTG CGTTCCATTG AAATTTCTCC 660 

TTCCCTAGTC TATTATTTCG CTTCAGATTG AACTGTTTTG ATACGAGCGA TTTGTTTTTT 720 

AACTTCTTTC AAGCGAG 73? 

(2) INFORMATION FOR SEQ ID NO: 184: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 740 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:184: 

CAAGCAATTG GCGCAGTTGT TCATTTTCTG TCTTTAAACT TTCCACCTCA TTTGATTTAA 
CTTCTAATTG GTAAAGCTGT TTCTTCAAAC TTTCATTTTC ATTATATGTT CGTGTCAAAT 
GAGCCAAATC TGATTTGACA GAATCAAACC ACTGAAAAGG TTTTTGTACA ACTCTATCAA 
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CCAATGAGAT TCCATCTCCT AATTTTGTCA CAATTGTACT TGAATAAGTC GTCGCTAAGA 
GAGCTGACAC AAGCAGAACA GTGACAAAAA CAATAATGAC ATATTTTGAT TTTTTAAAAC 
GGTTCATATC CCTACCTTTA TATCAAAAAC TGTTACAGTA ACTTTTTATC AATTCCTGAA 
AGCTACTAAG ATTTTAAGAA AAATAAACAA CAACCAAGTA CGAGAATAAC AAGAATGGTC 
AGCGTATCCT TTAGAGTCCA TTTCAATTGT CGGTATTGAC TTCTGCCTTT TCCACCCTGA 
TAGCCACGCG CTTCCATAGC GATAGCCAAG GAATCTGCAC GTTTTAAACT TCTCGCAAAA 
AGAAGAATCA AAATGGGAAT CATCGCCTTT ACTTTTTGAA CGATGCTTCC TTCTCCAAAA 
TCCACTCCAC GAGGCTTTCT GTGCATTCAA AATCCGAGTC GTATCATCCA TCAAGGTTCG 
GACAATACGC AAACTCATAG ACAGCATCAA TCCAATTTCA TGAACTGGAA CTTTCACACG 
CTTTAAAGGT GCTAATAAAG 

(2) INFORMATION FOR SEQ ID NO: 185 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 567 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

ACCTTTGATA AAACAATCGA TTTACCCTTT GAGAATTTAA GCCTTCCTGC ACCTGAAAAA 60 

TTTGATACTA TTTTGACACA ATTTTATGGA GATTATATGA CCCTACCACC AGAAGAAAAA 120 

CGCTTCTACA GTCATGAATT TCACGCTTAT AAATTGGAGG ATTAGGATGC AATATTTAGA 180 

AAAAAAAGAA ATTAAAGAAA TTCAACTAGC CCTGCTGGAC TATATTGATG AGACTTGTAA 240 

GAAACATGAT ATTCCTTATT TTCTCAGTTA TGGAACCATG CTTGGAGCCA TCCGCCACAA 300 

AGGTATGATT CCTTGGGATG ATGATATTGA TATTTCCCTT TATCGTGAGG ATTATGAGCG 360 

TTTACTGAAG ATTATTGAAG AAGAAAATCA CCCTCGCTAC AAGGTTCTTT CCTACGATAC 420 

ATCTTCTTGG TACTTCCATA ATTTCGCATC GATTTTGGAC ACTTCTACTG TTATAGAAGA 480 

ACATGTTAAG TACAAGCGTC ATGATACCAG CCTTTTCATC GATGTCTCCC ATTGATCGAT 540 
TTACAGACTT GAACATTGTC GACAGAG 

567 

(2) INFORMATION FOR SEQ ID NO: 186: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO:186: 




CCTTGCGTGT 


AGCTTACTTC 


AAGGTTCAm 


ATCCTATTTA TTACTACTGT GCTTACTTCT 


60 


CCATTCGTGC 


TAAGGCTTTT 




CCATGGGTGC GGGCTTGGAG GTCATCAAGC 


120 


GCAGAATGGA 


AGAAATCTCT 


unnnnnUVaun 


AGAACAATGA AGCCTCTAAT GTGGAAATCG 


180 


ATCTCTATAC 


AACTCTTGAG 


ATTGTCAATG 


AGATGTGGGA ACGAGGTTTC AAGTTTGGTA 


240 


AATTAGATCT 


CTACTGTAGT 


CAGACGACAG 


AGTTCCTCAT CGACGGGGAT ACCCTTATCC 


300 


CACCATTTGT 


AGCAATGGAT 


GGTCTGGGGG 


AGAACGTTGC CAAGCAACTG GTGCGGGCGC 


360 


GTGAAGAGGG 


AGAATTCCTC 


TCTAAAACAG 


AACTACGCAA GCGTGGTGGA CTCTCATCAA 


420 


CCTTGGTTGA AAAGATGGAT 


GAGATGGGTA 


TTCTTGGAAA TATGCCAGAG GATAACCAGT 


480 


TGAGTTTGTT 


TGATGAGTTG 


TTTTAAAAAA 


TTGCTTAATA ATCTATTAAA AGAAGCTAAC 


540 


GTATATCCAA 


TAGATTTACA 


TTAG 




564 



(2) INFORMATION FOR SEQ ID NO: 187 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 



CGAAGTTATC 


TTGGTGGATG 


ATGTGCTCTA 


TACAGGTCGT 


ACCATCCGTG 


CTGCTATTGA 


60 


TAATATTGTC 


GGTCATGGTC 


GTCCTGCGCG 


TGTGAGTTTA 


GCAGTTCTAG 


TCGATCGTGG 


120 


ACATAGAGAA 


TTGCCAATCC 


GTCCAGATTA 


CGTTGGAAAA 


AATATCCCAA 


CCAGTCGTTC 


180 


TGAAGAAATC 


ATCGTAGAGA 


TGACAGAACT 


TGATGACCAA 


GACAGAGTTC 


TGATTACTGA 


240 


AGAAGCTTAG 


AAAGTTAAAG 


GAGTAGCCAT 


GTCAGAAAAT 


CAACAAGCAT 


TGAACCATGT 


300 


GGTGTCCATG 


GAAGACCTCA 


CTGTTGATCA 


AGTAATGAAA 


TTGATCAAGC 


GAGGAATTGA 


360 


GTTTAAAAAT 


GGAGCCCAGA 


TTCCCTATGA 


AGACCATCCG 


ATTGTTTCCA 


ATCTCTTCTT 


420 


TGAGGATTCT 


ACACGGACAC 


ATAAGTCCTT 


TGAAGTCGCA GAGATTAAAC TTGGATTGGA 


480 


ACGACTTGAC TTTGATGTGA AAACTAGTTC 


GGTTAATAAG 


GGGTGAGACA 


CTTTATGACA 


540 


CCATTTTGAC 


TCTGTCTGCT 


TTAG 








564 



(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 727 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188 : 

TTTAAGTTCC AATTGTAAAC TCACGTTCCT GAACCTGGTT CTCCATTTTC AGAGATTTCC 60 

GGTGCAGGAT TTTTTGGTGT CGCGAAACGA ATATTTCCAC GTCCACCACG ACCACCGTGG 120 

GCAACGATAA ATTCTTGCCC ATGTTCAATC AAATCTGTTA AAACCTTGCC AGTCTCCGCA 180 

TCACGAACAG TCGTACCTTG TGGTACTCGA ACTCTAAGGT CCTCAGCACC ACGACCATGC 240 

ATCCCTTTGG TCATCCCTTT TTCACCAGAA TCAGCCTTGA AATGACGATT GTAGCGGAAA 300 

TCCATCAAGG TACGTAGTCC TTCGTCTACA ACGAAGACCA CATTGCCTCC ACGACCACCA 360 

TCACCACCCC AAGGGCCTCC ATTAGGGACA TATTTTTCAC GACGAAAGGC AACCATACCA 420 

TCGCCACCAT TACCAGCCTT GACCTTAATC TTAGCTGTAT CTAAAAACAT ACTCATTATT 480 

TCTTCTCACT TTAAAAAAGG GCTGGGAAAT CCCAGTCACT AAATTTTCTT GAATCTATTT 540 

TATAGATTAC TGAGGGCACC AATTGCAGTT GCAAAAATTC CCAATAAACT TGCTACTAGC 600 

ATGATAATCA CGATAAACAA GGTTATTTTC TCAAACATAG TTTTTTTACG ATTTCCATTA 660 

TCTCCAAATT GCCATTTTTC TCTCCTTCCG TTACATTCTA TTTTCTATTA TCTTAGCATG 720 

AATTGAG 72? 

(2) INFORMATION FOR SEQ ID NO: 189 : 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1071 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:189: 

CTGTGATAAA GGCAAATGGA AGGAACTAGA TAAACTGGGA CTTAAAACCA GTAGCAATCA 60 

TGTAAGATGA GTGTTCCAAA CAACTGGAAG CAAAGGAGGG CAACAGATAC AGGCATTCCC 120 

AAAATCAATG GTACCAAGAT GGTCGCACCA AACATGGCGA AAACGTGTTG GAAGCTAAGG 180 

AGAATTCCTT TTCCA7CCGA AGGACGTTGG TCAACGTCTA GTAACAAATC AACAGTTGAT 240 

TCTTGTTTCA TATAAACCTC ACTTTTGGGC ATCAGAAAAG AGCCCATTAT TTTACAGCAA 300 

TAGGCCCTCA GAGTAAGACT TCTCTAAAAT CTAGACTTTA AAGAGTTTCA AATATTTTTG 360 

CGTCTTCGTC ACCTCACGGG ATGACATTAA AAACCTTTGC TTAGGATAGT ATAGCAGAAA 420 

AAAATGATTT TGTAAATCAT TTTTTCCCGA GCCTAGAAAT AAAAGAGCGA GGCTGATTTT 480 

GTAAACATTT GTTTACTAAA AAATTTAAAG TGGGCGTTTA TTTGGCATAC CAGCCTTACG 540 

TGGATATTTA TTTGGTGTTT CTTTTTTCTT TTCTACCACT GTGATATAGC GCGGATCTCT 600 

ATTCGGTAGG GCGTACTGAG ATTGTCTTCG ACCTTACTAA AAAGGAGATT GAGGGCATTC 660 
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TTAGCTTCTA ATAATTCCTC AGGCGCATTG 
ACCTTAAGGT AGGGAATAGT CAATTCAGAT 
GTTACAAAAT CATATTGAGC ACGGAAGTTC 
TGGTAGAAAT GAACTCCGTT CAAATCCAGT 
CGCTTATTGA GTGAATCAAT AATGGTCACA 
AGACTAGGAA ATCCTGCCCC AGCCCCGATA 
AAACCTTGAA GAATGGGTGC AATCGAATCG 

(2) INFORMATION FOR SEQ 
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ATGGCCTTGA 


GTGCCAATAG 


TTTGCCACCA 


770 

/ £, \J 


AGGACCTGCA 


TACGGGCAAC 


CGCACGAGCT 




TTGTCTTGGG 


CAAAATCTTC 


GGCACGTCCG 


840 


TCTTGAGCCA 


AGAGTTGTAG 


GAAGTTGATG 


900 


TCTAACTCCG 


GATAGAGAAT 


TTTCATTGGT 


960 


TCAAGAAGTT 


TGATAGTTTC ATTGGGAATC 


1020 


TAAAAATGTT 


TGAGATAAAC 


T 


1071 



ID NO: 190: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:190: 

GGAACGTCGC TATCAAATAG AAGCACAACA TATTGAGCGA ATTTTTAGCC ATTTTGACGC 60 

CGATATGGCT ATATTCCCTG AACTAGCTAC CAATATTCAG AGGTGAGCAA GAAAACCAGA 120 

GAATCAAACT ATTGTTTTCA TCAAGTTGGA CTTTTCTATG GCCAACTATG ATATTTTCAC 180 

TTCTCCACCT ACCAATAGTG GAATAGCTCC TGTGACTGTG ATTGTCAAGA AAAGTTATGG 240 

TTTCTATACA GAAGCTAAAA CTTTTCATAC AACACGGTTC GGGACAATTG TATTACATTC 300 

GAGAAAACAA AATATACCAG ATATCATTGC CTTGCATACT GCTCCTCCTC TGCCAGGTTT 360 

AATGGAAATC TGGAAGCAAG ACTTAAACAT CATTCATAAT CAATTGGCTT CAAAATATCC 420 

AAAGGCTATT ATTGCAGGTG ATTTTAATGC AACTATGCGT CATGGAGCAC TTGCAAAAAT 480 

AAGCTCTCAT AGGGACGCAT TAAATGTACT GCCACCTTTT GAAAGAGGAA CTTGGAATAG 540 

CCAAAGTCCA AAACTTTTTA ATGCAACAAT AGATCATATT TTATTGCCTA AAAACCACTA 600 

CTATGTTAAA GATTTAGACA TTGTAAGTTT TCAAAACTCT GATCATAGAT GTATTTTTAC 660 

AGAAATCACA TTTTAATTAT TTTATATAAA ATCACCCCTC TAGTGTTCAT AAACTAGAGG 720 

GGGAATTTGT ATCCTACTAT CGTTTAACGC ACTTCTGCAT TGACTTTTTC TTCCGAGAGA 780 

CGCTTGGATT TTTTCCATGT AGCGTGCGAC TTCCTNCGTC CGTTAAG 8 27 

(2) INFORMATION FOR SSQ ID NO: 191: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191 : 

CAGGGGATGC CAGTTGATTT GCTTCAAGTT GACTTGACTC GTACTTGGGA AATCCTCGGA 
GAAATCACTG GGGATGCTGC TCCAGATGAA CTCATCACCC AACTCTTTAG CCAATTCTGT 
TTAGGAAAAT AAGAAAAATC CATGATCCTT CATTCGGTCA TGGATTTTAT TGTCTTTATT 
AGTAATCTGG TCTTAAGACC CCTGTTACAG TTGCCTTAGT TGCTTCGTAG TCGCCATCTA 
CGACAACCTT GATAATGCGT TTGACATCTT CTTCTGGTGC TGGAACAAGA GGTAGACGAG 
TGGGTCCAGC TTCAAATCCC ATATAGTTAA GAATTGCCTT AACTGGAGCA GGACTTGGAT 
AAGAGAAGAG AGCATTAACC TTAGGAATGA ATTTACGCTG AATTGCTGCG GCTTTCTTCA 
TATCGCTTTC TGCAATGGCA GTAAACATCT CGTGCATTTC ATCCCCATTT GTATGAGAGG 
CAACAGAAAT AACCCCATCC GCCCCAAGGT TCATGGCATG GAAAGCATCT CCATCCTCAC 
CTGTATAAAT CAAGAACTCT TCAGGCTTGT GCTCAATCAA GTTAAGCCAT ATTAGCCAAG 
CTAGTTACAT TCCTTTGACA CCCGATAATA TTTGGATGGT CCAGCCCAAG CGAAGCATGG 
TTTCCTGGAG TTCAATTCGA CAACTACACG CCCCTGGAAT GTTATAGATA ATAATTGGTA 
GGTCAGAAGC ATCTGCAATA GCCTTAAAGT GCTGATACAT CCCTTCTTGA GAAGGTTTGT 
TGTAGTAAGG AACAATAGCA AGCCCAG 

(2) INFORMATION FOR SEQ ID NO: 192: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1191 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

CGTGCCTTGG CCAATGATCC AAAAATCTTG ATTTCAGACG AGTCGCTTCA AATTTCGGCC 60 
CCTGGACCCT TAAGACCAAC CCAAGCAGAT TTTGGCCCTT GGTTGCAAGA TTTGAACCAA 120 
AAATTAGGCT TGACTGTTGT CCTGATTACG CATGAAATGC AGATTGTCAA AGACATTGCC 180 
AACCGTGTTG CAGTTATGCA GGATGGGCAT TTGATTGAAG AGAGTAGTGT GCTTGAAATC 240 
TTCTCAGACC CTAAACAACC TTTGACTCAA GACTTTATCT CAACAGCTAC AGGTATTGAC 300 
GAAGCCATGG TCAAAATCGA GAAGCAAGAA ATCGTGGAAC ACTTGTCTGA AAACAGTCTC 360 
TTGGTGCAAC TCAAGTACGC TGGATCTTCA ACAGACGAGC CACTTTTGAA TGAATTGTAC 420 
AAGCATTATC AAGTAATGGC TAATATTCTC TATGGGAATA TCGAAATCCT CGATGGTACT 
CCTGTTGGAG AATTGGTGGT GGTCTTGTCA GGTGAAAAAG CAGCGCTGGC AGGTGCTCAA 
GAAGCCATTC GTCAAGCAGG CGTACAGTTA AAAGTATTGA AGGGAGGACA GTAAGATGGA 
ATCATTGATT CAAACCTATT TACCAAATGT CTATAAGATG GGTTGGTCTG GTCAGGCAGG 
CTGGGGAACA GCTATCTACC TAACCCTCTA TATGACAGTT CTTTCCTTCA TTATCGGAGG 
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CTTCTTGGGG CTAGTGGCAG GTCTCTTTCT 
GAATAAAGTC GTATTCTGGA TTTTAGACAA 
TATCATCCTC TTGGCAATCT TGTCACCACT 
GCCAAATGCA AGCCCTTGTC CCACTTTCTT 
TGCAGGTTGT CTTGGCTGAA TTGGATGGCG 
CGACTTTCTG GGACATCGTG GGTGTTTACC 
TGACGACTGT GACCTTGATT TCCCTTGTTG 
GCGCCGGGAA AAGCGGTTTG CGTTATTGGG 

(2) INFORMATION FOR SEQ 
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CGTCTTGACA GCGCCAGGTG GTGTCTTGGA 780 

AATTACCTCA ATTTTTCGTG CGGTTCCCTT 840 

TTCTCACTTG ATTGAAAAAA CAAGTATCGG 900 

TTGCAGTCTT TGCCTTCTTT GCCCGTCAGG 960 

GTGTCATTGA GGCGGCTCAA GCGAGCGGAG 1020 

TATCAGAAGG TCTTCCAGAT TTGATCCGTG 1080 

GGGAAACAGC TATGGCCGGT GCGGTTGGAG 1140 

CGCTCTTCCG CTTCCTCGCT C 1191 

ID NO:193: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

TTCAATGGGC CCTGTAAGGC TGCCAAAAAG AGAAACCAAT TAAAACCGCG CCAGCCGCCA 60 
AAATCCTTGG GCATTTTTAG CCAGCATGGG AAAAGTGATA GTGACTGGCC AAATCCACCA 120 
CATTTTCGAT AGCAGAATTG ATAATTTCAA AGGCTACCAC CAAGAAAATG CTCAATAGGA 180 
GAAAGAGCCA TTCGATTCGT GACACCTGAA AACCAAAACC TGCAAGGATG ACCACTAGAG 240 
CCGTCACTGC ATGTTTTCGC ATATTGCGTT CTTCCTTGAT AGCAGTAAAA ATCCCTGTGA 300 
TGGCAAATTC TAAACTGGAT ATCAGGTCAC GATTTTTCCA TTTTCGTTTA TTGTCTTGTG 360 
AGTCCATAGG CTGTCAAAAT TTCTTCTTGT AAACCGAACA TCTCCGCTTC TTCTTCCGGA 420 
GTATAGTGAT CATAGCCGTT AATATGTAAA AAGCCGTGTA CTGCCAAGAA GCCCATCTCA 
CGCTCAAAGC TGTGACCATA TTCTTCGGCC TGCTCATGAG CCTTATCGAT AGAGATGAAC 
AATTCCCCAA TATAGGCATC AAACTCAGAC ATCATCTCTG CCAATTCTGG ATTTTCAAGC 
AAATCCTCTT CGTCAAAGGC AATTTCCAAT TCTGGTTTAT ACTCAAGGCT GATGACATCT 
GTCGGACGGT CGGTGTCACG GT 



480 
540 
600 
660 
682 



(2) INFORMATION FOR SEQ ID NO: 194; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:194: 

CTACTTCTGT AAATTCTTTA TAAATGACAT TTCCATTTCT TTCCACAGTC ACCAGCATTT 60 

CTGTTGGATT ATAATCCAAA TTAAATAATT CTTCAAAAAA CTTTATCTTT TCATGATAAT 120 

CTGCAATGTC AGAAAGATTA TTTTTATTAA CACTCATAAC ACTAGCCCCT TATTTTTCAG 180 

TAAACTCTCC GTCTACGACG TCATCGCCTG CGTTTCCTGT TGCTTGTGCG CCTTCTGCTC 240 

CTTCTTGAGC TTGTTGCGCT GCTGCGGCTT GTTCGTAAAG TTTCACAGCA AGGCCTTGAG 300 

CTTTTTCGTT CAACGCTTCA AGTTTCGCTT TCATGTCGTC CAAGTTGTTG TCTTCTTGAG 360 

CTTTCTTAAG GTCATCAAGG GCAGCTTGGG CAGCGTCACG TTCTGCGTCG AAGCCTTTAC 420 

CTTCAGTTTC CTTGATTGTC TTTTCAGTCG CAAAGATTGC TTGGTCTACT TCATTACGAA 480 

GGTCTACTTC TTCTTTACGT TTCTTATCAG CTTCAGCGTT TGCTTCGGCA TCTTTCATCA 540 

TGCGGTCGAT TTCTTCGTCA GTCAAACCTG AGTTCGATTG GATGACAATA GTTTGTTCTT 600 

TTTGAGTTCC AAGGTCTTTG GCCTTAACAG ACACGATACC GTTCTTGTCG ATGTCAAATG 660 

TTACTTCGAT TTGAGGAATT CCACGAGGTG CAGCTGGGAT ATCAGTCAAT TGGAAGCGTC 720 

CAAGAGTCTT GTTATCTGCT GCCATTGGGC GTTCACCTTG AAGAACGTGG ATATCAACGG 780 

CTGGTTGGTT GTCTGCTGCT GTTGAGAAGA CTTGTGATTT AGATGTTGGG ATTGTAGTGT 840 

TGCGATCGAT AAGTTTTGTA AATACTCCAC CCATTGTTTC GATACCAAGT GACAATGGCG 900 

TTACATCAAG AAGGACAACG TCCTTGACAT CACCAGTAAT CACACCACCT TGGATAGCCG 960 

CACCCATAGC AACTACTTCA TCAGGGTTTA CTGATTTGTT TGGTTCTTTA CCAGTTTCAG 1020 

CTTTAACAGC TTCAACAACG GCAGGGATAC GAGTTGAACC ACCAACAAGG ATAACTTCGT 1080 

CGATTTCTGA CAAG 1Q94 

(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 195: 

CTCTGGGTTC CATTCTTGTG GCTGTAATAC TAGGAATTTT TCTAGGATTT TTCTGGCAAG 60 

AACCGATTCA GAAAGAAAAT CGTCTGGCTT GTCATGAGCA TGATTTTTCT TAACTTGAGT 120 

TCTGCAAAAA AAGTTTTTCA AGTCTTTGTG CAGGCCATTG ATGAATTTTT TGATACGGGG 180 

CGTTATTTGG TATTTGGCTG CCTCTTTGCT TCTATAATAC AGGTCTACGT TCCGACTCGG 240 

ATTCTGACCT CTATCAGTGC GACCCCTCTT TTTGCCATCC TGCTCTTGAT GATTTTAGCC 300 

TTTCTTCTTT CGCTCTGTAG TGAGGCGGAT GCCTTTATAG GTGCTTCTCT TCTCTCGAGT 360 

TTCGGTTTGG CACCAGTTCT GGCCTTTCTC GTCATTGGTC CAATGCTGGA TATCAAAAAT 420 

ATTCTCATGA TGAAAAATTA CTTGAAAGCA CGATTTATCA GTCACTTCAT AACAATTGTA 480 
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ACTCTTGTCG TCTTAGTCTA TTCTCTCTTG 
TTTTAGCTGG CTATTTTGAA CTGACTATTT 
ACATCAACAT GCACTATTCC TATCTGGCCT 
CTATCGTTCA ATTGTATATC TGGATGAAGC 
GATTAGCCAA GATGACGAGT ATTTCTCTTC 
TCCCAACTGT TAG 

(2) INFORMATION FOR SEQ 




PCT/US97/D7950 

ATTGGAGTCA TCCTATGATT CGATTTTTAG 540 

ACCTCCATCT GTCGGGCAAA CTAAACCAGT 600 

ATATCTCCAT GGTGCTTTCT TTTATCTTGG 660 

AAGTCAAAAC CCACAGTCAT CTGAACAGCC 720 

TGGCTATTCC ACTTGTCATC GGCTTAACTT 780 

793 

ID NO: 196: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:196: 

TTCTTCCAAT CCAGGAAGTN AACTTGTTCC AACCTTCTGC TAGTTTCCCA GAATCCATTC 
TGCCNAATTT ATTAGCTCTA TTCTCTAATG GAGAAGCACC TGAAAGTAAC TGACTGGATT 
TGTCTGCTAA TTGCCCAGAT CCTGATTGTA ATTAACCAAC TCCTGCTGTC AATCTAGAAG 
ATTTTTGTGT CAAGGTGTTT GAGCCTGAAA CTAGTTGATC CAAACTACCT GTCAAGGTGG 
CATTTTTTTC ACTTAGTTGA CTTGCGCCCT GAGAAACTTT ATCAACACCT GTAGTATATG 
CGTTTACACC TGATGCAATC GACTGACTGG CAGGAACTAA TTTGCTAGTA ACATCTCCTT 
GTATCTCTGT TAATCCACTT GACAATCCTA TCAAAGAAGT AGAAGTAATA GGTGATACTT 
GATTAGATTG ATTTTTTAAA GTCGAAAGAT TAGAAGATTG ATTTTGTAAG TTTTCTAAAC 
TTCCCTGTAA ATCTTGTACT AAAGCGACAA TTGATTGAGC CGATTGAATA CTATCAGTTG 
AATTTTGAGA TACAGAATCA CTTATCTCAG TTTGTTGCTC ACTTGTCAAT GATTGATAAG 
CTGCTGTCGA TTGAATATTG GATAATGTAG TTGTTTTTTC AGATAAATCA CTTGGTAAAG 
TCACTACCAT ATAGTAATCG CCATCTTCCA ATCCCTTCTT TCCTTCCTCT TCATCTACAA 
AATGAAAATC CAAGGTTTTA TTTTCTTTTA AATTGGACAC CATGTCTTTT CCTATTGCCA 
TAGTATTACC ATTATAGGAA GCCTCTTTAT CATTATTTAC AACTGCCACA GGTAAGTCAG 
ACAATTGCCC ATATGGCTCC CACATTGATG ACAAAAATAT GATATTGTAC AGAG 

(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 627 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 

CTATGGCAGA TCTAGACCAT CCTCATATCG TTCGGATAAC AGATATTGGT GAGGAAGACG 
GTCAACAGTA TCTTGCAATG GAGTATGTTG CTGGACTAGA CCTCAAACGC TATATCAAGG 
AACATTATCC TCTTTCTAAT GAAGAAGCCG TCCGTATCAT GGGACAAATT CTCTTGGCTA 
TGCGCTTGGC CCATACTCGA GGAATTGTTC ACAGGGACTT GAAACCTCAA AATATCCTTT 
TGACACCAGA TGGGACTGCC AAGGTCACAG ACTTTGGGAT TGCTGTAGCC TTTGCAGAGA 
CAAGTCTGAC CCAGACTAAC TCCGATGTTT GGGCTCAGTT CATTACTTGT CACCAGAGCA 
GGCGCGTGGT TCTAAGGCGA CTGTGCAGAG TGATATCTAT GCCATGGGGA TTATTTTCTA 
TGAGATGTTG ACAGGCCATA TCCCTTATGA CGGGGATAGC GCGGTGACCA TTGCCCTCCA «0 
GCATTTCCAG AAACCCCTGC CGTCCGTTAT TGCAGAAAAT CCATCTGTAC CTCAGGCTTT 540 
AGAAAATGTT ATTATCAAGG CAACTGCTAA AAAGTTGACC AATCGCTACC GCTCGGTTTC 600 
AGAGATGTAT GTGGACTTGT CTAGTAG 

627 

(2) INFORMATION FOR SEQ ID NO: 198 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 729 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 



60 



CTATGCTAGC GACAGCTATT CTTGGTGTCG TGATTGAGTT TCTTGCTTAC CGACCTTTGC 

GCCACTCTAC TCGTATTGCT GTTTTGATTA CGGCTATTGG GGTTTCTTTC CTATTGGAGT 120 

ATGGAATGGT CTATCTGGTT GGTGCCAATA CCCGTGCCTT CCCTCAAGCG ATTCAAACAG 180 

TTCGATATGA TTTGGGACCA ATTAGCTTAA CAAATGTGCA GTTAATGATT TTCGCCATTT 240 

CCTTGATTTT GATGATTTTG TTACAAGTCA TTGTCCAAAA GACTAAGATG GGGAAAGCCA 300 

TGCGTGCAGT ATCAGTAGAT AGCGACGCGG CACAATTGAT GGGGATCAAT GTAAACCGTA 360 

CGATTAGCTT TACCTTCGCT TTGGGTTCTG CTCTTGCGGG TGCGGCTGGT GTTCTGATTG 420 

CTCTTTATTA TAACTCTCTT GA3CCTTTGA TGGGGGTTAC TCCAGGTCTT AAATCTTTCG 480 

TTGCCGCAGT ACTTGGTGGT ATCGGAATTA TTCCTGGTGC GGCTCTTGGT GGCTTTGTGA 540 

TTGGTCTATT GGAAACCTTT GCGACAGCCT TTGGGATGTC AGATTTCCGT GATGCCATTG 600 

TTTATGGAAT CTTGTTGTTG ATCTTGATTG TCCGCCCAGC TGGTATCCTT GGTAAGAATG 660 

TGAAAGAGAA GGTGTAAACG ATGAAGGAAA ATTTAAAAGT TAATATTCTA TGGTTACTCC 720 
TTTTGTTAA 



729 



(2) INFORMATION FOR SEQ ID NO: 199 : 
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U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

CCACAGATTN GACCCAAACA GTATGGTCTC CGATATGTTC AGAAATTGTT ANGATNGCGA 
TTGGNAAGAT AGCGATGGTT CTGGACCAAA GTAAAGATTG TACTCTTTAA AGGCACCACC 
TGTGCTAAAT GGCAAGTAGA AACCAGGAAT TTCGAACCAG TTGGCTTTAA GAACTGGTGT 
AAAGTCAACC AAGCCAAGAG TTAGTGCGAA AAGGTAACCA CCGATAATGG CAAAGAGGAA 
TGGAATGATT CGTAGGAAGC CTTTTCCTTT TGTATTGATA AAGGCAGCAA TTAGGAAAGT 
AACAACGGCT ACCAGAGCAT TTTTCCAATT TCCGTCTGCT ACAAGACCTG CATTGGTAAC 
AGCTGAACCT GCAAGTCCAA GACCGATAAC GATGATCATA GGACCGATAA TGATTGGTGG 
CAAGAGTTTA TCAATCCATT TTGTTCCTAC AAATCGGATG CTGGTAGCAA CAAGGACATA 
GACCAAACCA GTCAAGATAA CCCCTGTTTG GGCAGCAGAT ACATCCCCCC CCATTTCTTT 
CATAGCCAGT GACATAG 

(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 

CTTAGTATCT AGAAAAGGAG AAATAAAATG GTTAAAGTAT TAGCAGCGTG CGGAAATGGA 
ATGGGTTCAT CAATGGTTAT CAAGATGAAG GTTC-AAAATG CTCTCCGTAA GCTTAATCAA 
ACAGATTTTA CAGTCAATTC ATGGCCAGTG TCGGTGAAGC TAAAGGTTTA GCAGTAGGAT 
ATGACATCGT AATCGCTTCT CTTCATTTGA TTCAAGAATT GGAAGGGCGA ACTAATGGGA 
AGTTAATTGG GCTTGATAAC TTGATGGATG ATAAAGAAAT CACCGAAAAA CTCAGTCAAG 
CACTACAGTA AAAGGTTGGA GGGGGCTGGA CAGAAACTGA GAGTTATCGT TTCTGTCCTT 
CTCCCTCTTT AAATAAAGGA GGCAGATATG AATTTAAAAC AAGCTTTAAT TGACAACGAC 
TCGATCCGAC TAGGTTTAGA GGCTAACGAT TGGAAAGAAG CAGTCAAGGT AGCAGTAGAT 
CCCTTGATTG AAAGTGGGGC AATTTTGCCA GAATTATTAC CAAGCTATCA TTGAATCGAC 
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TGAAGAGAAT TATGGGCCTT ACTATATCTT GATGCCAGGT ATGGCTATGC CCCACGCTAG 600 
ACCTGAAG 

608 

(2) INFORMATION FOR SEQ ID NO: 201: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 01: 

CTGATTTCAC TTGGGTTAGA GCATAGCCTA CCGCTCTCTA GCCACCTCAA TATCAGCATT 60 

GGTCCCTTAA TTCAAACCTG GCGTATCGGT TTTTCAGATG CCAAGGTCGC CCAGCCTCAA 120 

AAAATTGAAT CGGTGCTGCC TCTAATCAAT CCTCATGGTA TCGAGTTAGA TTCTTCTACT 180 

TCTACTGTGT TTTTAAAACA GAAAGGAATG AAGATCGATC TAGGTTGTTT AGCCAAAGGA 240 

TACAGTGCGG ATAAGGTTGC CCAATTTCTT AGGAAAGAGG GGGTGACTTC TGCCTTGATC 300 

AATCTGGGAG GGAATATCCT GACCATTGGA AAAAATCAGG CAAGAGGGGA TAACCCATCG 360 

CAAATCGGGG ATTCAAGGAC CCAGCCAATC CTAGGGGAAA TCATTTAATG ACCATCCCTG 420 

CTTGTCAATA AATCTGTCGT GACTTCAGGC ATTTATGAAC GTCACCTGAC CGTCAATGGA 480 

CAAGATTACC ATCACATTTT TGACAGTCAA ACAGGATATC CTGTTGAAAC GGAACTAGCG 540 

AGTCTAACAA TCATCTCTGA TAAATCAGTC GATGGCGAAA TCTGGACGAC TCGACTCTTT 600 

GGAGAAAGAC CGGCTTCTAT CCTCTGGCAA GTCGAAAGTT TGGAGGGCAT CGAAGTCATC 660 

CTCATCGATA AAGAAGGCCA CCTAAG roif 

OOD 

(2) INFORMATION FOR SEQ ID NO: 2 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 

CTCTGGCTAC GCTCAAGACG AACATGGTAA CTTTGACAAG TTCTGGAATG GAACAGTCTT 60 

CCATATGGTA GGAAAAGACA TCCTTCGCTT CCACTCTATC TACTGGCCAA TCCTTCTTAT 120 

GATGTTGGAT GTTAAATTAC CTGATCGTTT GATTGCCCAT GGTTGGTTTG TCATGAAAGA 180 
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CGGAAAAATG 


TCTAAGTCAA 


AAGGGAATGT 


CGTTTACCCT 


GAAATGTTGG 


TAGAGCGTTA 


240 


TGGACTAGAT 


CCACTTCGTT 


ACTACCTCAT 


GCGTAACCTT 


CCAGTTGGTT 


CAGACGGGAA 


300 


CCTTTACTCC 


TGAAGACTAT 


GTCGGTCGTA 


TCAACTATGA 


ATTGGCTAAT 


GACCTTGGGA 


360 


ACCTCCTTAA 


CCGTACGGTT 


TCCATGATTA 


ATAAGTACTT 


TGATGGACAA 


ATCCCTGCCT 


420 


ATGTAGAAGG 


TGTGACTGAA 


TTTGATCATG 


TTCTTGCTGA 


GGTTGCAGAA 


AAATCAATCG 


480 


CAGACTTCCA 


TACACACATG 


GAAGCAGTTG 


ACTATCCACG 


TGCGCTTGAA 


GCAGTCTGGA 




CTCTGATCTC 


TCGTACCAAT AAATACATCG 


ATGAGACTGC 


ACCATGGGTC 


TTGGACAA(2ft 




ATGAAGCTCT 


TCGTGACCAA 


TTGGCAAGTG 


TCATGAGCCA 






OOU 


TAGTTGCTCA 


CTTGATTGAA 


CCATTTATGA 


TGGAAACTAG 


TCGTGCAGTT 


TTGACTCAAC 


720 


TTGGTTTGGA 


AGAAGTTTCT 


AGTCTTGAAA 


ACTTGAAGTT 


TGGCTGACTT 


CCCAGCAGAT 


780 


GTGACTGTAG 


TTGCCAAACG 


AACACCTATC 


TTTCCACGTC 


TAAATATGGA 


AGAAGAAATC 


840 


GCCTATATCA 


AGGAACAAAT 


GGAAGGCAAT 


AAACCAGCAG 


TCGAAAAAGA 


ATGGAATCCG 


900 


GACGAAGTTG 


AG 










912 



(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1097 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 



CTATTTTTTA 


TTGCTGCANT 


ACTTTTCTTG 


GCTTGGTACC 


TTCAGCTGGA 


CCAATGACAC 


60 


CTGCCATCTC 


AAGCTCTTCC 


ATGAGACGGG 


TCGCACGGTT 


AAATCCAACT 


GACAAACGAC 


120 


GCTGAATCAT 


GGATGGCACT 


GGCTTTCTGT 


GTTTCGATAA 


CCAAAGACTT 


AGCTTCTTCA 


180 


AAAAGCGGAT 


CACCACCAGC 


ATCTCCATCC 


GAAAATTCTC 


CTTCATTTTC 


AGAAACCTCA 


240 


CCTGGATCAA 


AACTCTCATC 


GTAGTCTGCA 


TCTGCCTGAG 


TCTTGATGAA 


GTTCACAATG 


300 


CGCTCAACAT 


CGTCATCCGA 


GATAAAGGAG 


CCTATGGAGA 


CGTAACTGGA 


TGATTTTCAT 


360 


TAATCGGTTT 


AAAGAGCATG 


TCTCCTCGAC 


CAAGAAGTTT 


TTCTGCTCCA 


TTTACATCCA 


420 


AAATCGTGAC 


GGGAGTCTGT 


TCCTGATGAA 


ACCGCAAATG 


CTACACGAGA 


TGGAACATTG 


480 


GCCTTAATCA 


AACCAGAGAT 


GACATCAACA 


GATGGACGCT 


GAGTTGCAAG 


AATCATGTGG 


540 


ATACCTGCAG 


CACGCGCCTT 


CTCCCC^AGA 


CGGATGATAG 


CATCTTCCAC 


TTCCTTGCTG 


600 


GCCACCATCA 


TGAGGTCAGC 


CAACTCATCC 


ACAATCACGA 


CAATGAATGG 


TAGCGGAATT 


660 


TGCTTGTACT 


CAGACTGGGA 


ATCGAACTCG 


TCTACCTTGG 


CATTAAAACC 


TGCAACAGCC 


720 


CGAACTCCCA 


CCTTGGCAAA 


GAGTTCATAA 


CGGTTTGCCA 


TTTCATCCAC 


AACCTTTTGC 


780 


ACAGCCCTGC 


TGGCTTTGCG 


TGGATTGGTC 


ACCACTGGCA 


ATCTAACAGG 


TGGGGAATAT 


840 


CACTGTAGAA 


CAGATAACTC 


AACCATCTTT 


GGGATCGACC 


ACCCATCCTC 


AGTAAATTTA 


900 


ACTTGATCTG 


GTCTCGCCTT 


CATGAGAATG 


CTANCAATAA 


TGCCGTTAAC 


TGCTACTGAC 


960 


TTCCCTGAAC 


CCGTTGAACC 


TGCAACTAGC 


AAGTGGGGCA 


TTTTAAAAAG 


GTCAAAAGCT 


1020 
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CTTGCGGTTC CATTAACAGC CTTCCCTAAA GGAATTTCCA AGAAATTTTC TGCTTCGTTT 1080 
GCGATTGTTC CATAGTT Hf 1080 

1097 

(2) INFORMATION FOR SEQ ID NO: 2 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1083 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204 : 



CTCTTGCAAC TGGCTTGATG AGAGACACTG CTTACCTATC AATGGGTAGT GTTTCGATGG 
GGATTGGCGG TTCTATTGTA AATCCAGATT TCTTCCAAGA ATACTTAGGA ATGCGAAATG 
AATCGGTAGA TATGACGGAG TTCACGCGCC GTATGGACCG TGGTATTTAT GACCCAGAAG 
AGTTCGAACG TGCTATGGTT TGGGTAAAAG AACATATCAA AGAAGGGGGT TGACCGAAAA 
TTCGTGAGGA TTTGATTCTT TCTAAAGAAG AGAAAGAAAA ACAATGGGAA TTTGTTATTA 
AGATGTTCAT GATTGGAACG TGACTTAATG GTTGGTAACC CAAGACTTGC TGACCTTGGT 
TTTGAGGAAG AAGCAGTTGG TCACCATGCT TTAGTAACTG GTTTCCAAGG TCAACGCCAG 
TGGACAGACC ATTTTCCAAA TGGGGACTTT ATGGAAACTT TCCTCAATAC TCAGTTTGAC 
TGGAATGGTA TTCGAAAACC ATTTGTATTT GCTTGTATTT GCGACAGAGA ATGATTCACT 
AAATGGTGTG TCTATGCTCT TTAATTATCT ATTAACAAAT ACTCCACAAA TCTTTGCTGA 
TGTGCGTACT TATTGGAGTC CAGAGGCTGT TAAACGTGTA ACGGGACATA CTTTAGAGGG 
TTGTGCTGCA GCTGGCTTCT TACATCTAAT CAACTCTGGT TCTTGTACAT TGGATGGTAC 
™l"Z T ACTCGAAATG GCAAACCTCT TATGAAACCA TTCTGGGAGT TGGAAGAAAG 
TTGAAGTTGC AGGCTATGCT TGAAAATACA GACTTCCCAC CAGCAAACCG CGAATACTTC 
CGTGGAGGAG GATTCTCAAC TCGTTTCTTG ACGAAGGGGG ATATGCCAGT AACAATGGTA 
CGTCTCAATC TTCTAAAAGG GGTTGGTCCA GTGCTACAAA TTGCAGAAGG TTATACACTT 
GAACTTCCTG AAGATGTTCA CCATACTTTA GATAATCGTA CAGATCCAGG ATGGCCAACT 

ACT^G CTCCACGTTT GACAGGAAAA GGTGCTTTCA AGTCTGTNTA TGACGTCATG 1 080 

AAT ^ 

1083 

• (2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 



CTCTGTTATT 


AAAGAAGATA 


ACGCTGCTAT 


CCGTGGTATG 


ATCACAGCAG TATCTCACTT 


60 


AGTAACAGTT 


GAAGAAGTAA 


ACTAATGAAG 


TTTTAGGGGA 


TGTGCACTGT ACCATCCCCT 


120 


AAAACTAGAT 


ATAGTCATCT 


ATGATGACAT 


CGTATAGGCG 


AGTTGATGGG GGAGACAACC 


180 


TTTTCTCCCT 


TATCGGCGCT 


AGCATTTTAC 


AAAAGAGGAG 


AAAATAAAAA TGAAACTTCA 


240 


TGAATTGAAA 


CCTGCAGAAG 


GTTCTCGTAA 


AGTACGTAAC 


CGCGTTGGTC GTGGTACTTC 


300 


ATCAGGTAAC 


GGTAAAACAT 


CTGGTCGTGG 


TCAAAAAGGT 


CAAAAAGCTC GTAGCGGTGG 


360 


CGGAGTTCGC 


CTTGGTTTTG 


AAGGTGGACA 


AACTCCATTG 


TTCCGTCGTC TTCCAAAACG 


420 


TGGATTCACT 


AACATCAACG 


CTAAAGAATA 


CGCAATTGTG 


AACCTTGACC AATTGAACGT 


480 


CTTTGAAGAT 


GGTGCTGAAG 


TAACTCCAGT 


TGTTCTTATC 


GAAGCAGGAA TTGTTAAAGC 


540 


TGAAAAGTCA 


GGTATTAAAA 


TTCTTGGTAA 


CGGTGAGTTG ACTAAGAAAT TGACTGTGAA 


600 


AGCAGCTAAA 


TTCTCTAAAT 


CAGCTGAAGA 


AGCTATCACT 


GCTAAAGGTG GTTCAGTAGA 


660 


AGTCATCTAA 


GAGAGGTGAC 


CTATGTTTTT 


TAAATTATTA AGAGAAGCTC TTAAAGTCAA 


720 


GCAGGTTCGA 


TCAAAAATTT 


TATTTACAAT 


TTTTATCGTT 


TTGGTCTTTC GTATCGGAAC 


780 


TAGCATTACA 


GTTCCTGGTG 


TGAATGCCAA 


TAG 




813 



(2) INFORMATION FOR SEQ ID NO: 206: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:206: 



CTGTCGTTCC ATGCGTGCCA ACTGTTTCTA GCCCCATTTC CTCAAGGACA CGTGCCACAG 60 

AGTCTCCAAC CGCAGGGGTT GGTGCCAAAC TCAAGTCCAC AATACCAAAC TCAACACCCA 120 

GTCTCTCAAT GGGGCATTTG GACCAAACCA ATTGGACCGG ATACGGAGTG ATTTTAAAAG 180 

GCAGTTTTCT CAACTGGTTT TCGGCTTACT AACATCAAAG CTCTGTCCAC GAACTTTTTC 240 

CAAAGCACGT TTCACCACAC CAGGACCAGA AACTCCGACA TTGATGATAA CATCTGCTTC 300 

CCCAACACCA TGAAAGGCAC CCGCCATAAA TGGATTGTCC TCAACAGCAT TAGCGAATAC 360 

AACCAACTTG GCCGCTCCCA TATCTGAAAG ATTTGCCGTT TCCTTGTATA ATTCGTCCCA 420 

TATCTGCCAC AGCCGTCATA TTAATACCAG ACTTGGTTGA GCCGATATTG ACTGACGAGC 480 

AGACCTTATC CGTCTCAGTC AAAGCGCGAG GAATGGAATT GATGAGAATC TCATCTCCCT 540 

TTTGATAACC TTTTTGTACT AAGGCAGAAA AACCACCAAT AAAGTCCACA CCAATCTCTT 600 

TCGCAGCCTT ATCAAGCGCT TTTGCCAGAA CCAGTAGTCC GTCGCATCTG TCGCTGCCCC 660 

AATCAGAGAA ATAGGTGTCA CCGATACACG CTTATTAACG ATAGGAATTC CCAACTCAGC 720 
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CGCA 

724 

(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 711 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

CTCACGAAAA GCTTTTGTAG AAGTATCTGT ACGACGCAAG ATTGACAATT TGTGTTGAAT 
CAGTGGGTGA TTAATAACTN CAATTTNCCC ATTTGTGGAA TTCCTTCTTT CAATTTATTC 
TTCTTATTAT ACCAATATTA ACGGTTTAAA AATCTCCTAA ACCATNTATN TNNGATAATT 
TTTACATTAG ATCAGCCTCT TTAAGAGCTG TCTGTACTGT CTCAAGTGGT AAATGGGTCA 
ATTCTGTCCC TTTTTCTIGA TAAAGGTATT GGGCGTAGTC GTCCATTCGG TACTGGTTGA 
TATAAACCAC ACGCTTGCAG CCGACCTGAA GCAATTGTTT TGTACAGTTG AGACAAGGAA 
AATGGGTTAC ATAGGCTGTA AAGCCTTTGG GAACACCACG CTCAGCACCT TGAAGGATAG 
CATTGACCTC AGCGTGAAGG GTGCGAACAC AGTGGCCTTC AATGACCAAA CATTCGTGAT 
CAATACAATG CTCAGTCCCT GACACCGAAC CATTGTAACC AGTGGAAATA ACCTTATTAT 
CTTTTACCAG AATCGCGCCC ACTTTAGCAC GTTTACAAGT GGAACGATTC GCAATTAGTA 
GGGCTTGGGC TGCAAAATAC TCATCCCAGG CCAGTCTTTT TTCAGTCATC TCTTTTCTCC 
TTTTTCTCTA TTTTTTAAAA AATGGTAAAC CTAAATCTGC AATCTTTTCA G 

(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 677 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
711 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 208: 

CTGTGGGACC AGNTTCAGCT GGATATTTCA AAAACTAACT ACTTACGGGT GATTAATGCT 
TTTACTCAAA TTGAAGCGGC TAAGGCTTAT TTATTTGCAA ACTCTGAGTT TTCGGGTGCG 
GATTGGGATA CGAAAATTTC AAGGGATATT TTCTGGGAAG AATCTATGCA TGGTATCTAT 
CCAGAGAATG TTGGGGTCAA TGCTAGACTC CTTAATGATG AAGCTGATTT TTTTGACTAT 

232 
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CTAAATCATT CTGCGATTTT TACTGCGGAA CGTGATGGGC AGACCTATTA TTTTTATCCT 
ATTCAGGCTG GGGACTATTT GGCTACGCCT GAAATCCAAG CATTTGCTCT GAATGGGGAT 
GAGGTTATTA TTTACCCCCA AGAGAAGGAT TTTGAAACTC ATCGTAGTTA CCAGTACCAA 
GACTTAACGA CTCGAGGAAC AGTTGAGTTT CGTAGTGTGT GTACACAGCC ACTTGATAGG 
ACTTTTGCTT CTGCAGCTTT TCACTTGGGA TTATTGGTTA ATTTAGACAA GTTAGAAGCT 
TACTTAGAAA CAGCACCTTT TCTTTAAAGT ATTTGGTTAT GATTACAAGT TTTTAAGGAG 
ACAATTTTCT AAGAAAAATC TTACAGATGA GGAAGAAACT ACGATTATTG AATTTTCCAA 
AGACTTACTC CTACTAG 

(2) INFORMATION FOR SEQ ID NO: 209: 



300 
360 
420 
480 
540 
600 
660 
677 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.-209: 

GTATTCGGAC GTTCGTTCGA ACCGGCCTGG TTGTTTCGCC GGCTGCACAA TGGCATTTTG 
GAAAATGAAT AAAAAGAATT TTATTCATCT TCNAAACGCT TAAGATTCTC ACACACCACA 
GATGTGTTTT AAAATCATAA TATTCAATCC CTCTATTCCA GCTCGTCAAC GATAAACATG 
TGAGCCATAT TCATNGCTTT TAGCAACGCC TTATATGGAT TGGTATCAGA TTCATAATCC 
TTCATTACAT TTACATCTCC TAGCTTCACA AGTATTTATA TATCATACAT CCTCATCTTC 
ATCCTTCATA TAATCTTTGG CAGCCCACTC CCACACAGGA AGTATCACCT CAAATACATC 
TCTATCAAAC ATCGGGGAAT GGAATTCCTT GGGAGGCTCT CTATCTGTAG AGTAAAGCCC 
AACTCCTTTA TCCTTCTTAA CTTGGGGAAC ATCTGTTGTG CCAAACATCA TATGGCTAGA 
TTCGCGAGTT GGGGCTCCAA GAAATATAGC ATTGACAAGC TGTTCTCTAA TATTGGTAGA 
AAGACT ATCG GAACGGGGTT GTTGCATAAC AATACAAACC CCCATAGATA GAGACCTTGC 
ACGATAGCAA CCATA ™ AAAATCTTTT TTAGCTTTAC TATCCATCGA 
AGCAACTATG AACTATATTC TTCTATTACC AAAAGTAAAG GAGGAAGGCC GAGGTCTATT 
CCAATACTAT CAAATAAATC ACTTTTGTTA TAAATTTCCA TCCGCCTATT CATCTCGTTA 
ATGTAGGGAC GCACGATTTC TAAAGGCATC CTCATTAGTT GTTCCATAAT TTTCTACCTG 
GAAGACCAGA AAATTTTGAT AATTTATAAA TATCTGAAAG TTTTGGATCA ATCACATATA 

TCTCCCAATO ACGGTCTCAG M *~ -~ s 

AAAATGATTT CCCAGCACCT GACGCGCCCG AGATAAGCAT TGAAAAGTTT TTTCTCAAAT 
ATCATCATA ° TCAAACTA " ATCTTCTTCC AGAGGGAGAG 

AAATTTCTTT TAAACGTrCA ""^ ATACAAAGAC ATATTCAACA TGAGTAGCAT 1140 
AAATTTCTTT ATATGATAAG 

1160 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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780 
840 
900 
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1020 
1080 



(2) INFORMATION FOR SEQ ID NO: 210: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

CTGGATATGG GTTACACCCA AGTCTTTGAA ATAGTCTAGT TTTTCAATGA AGGCTTCAAA 60 

AGTCCCAAAT GGTTTGGTCA AGTCTTTTGC AATGGCAGGA TCTGAAGTGA AGTCACGCAC 120 

ATGAGCTTCG TAGATAACGG CGTCTTCACG AGTCTTGAAA TTGTGAATCT TATTTGGCCC 180 

ATAAGGCCCC TTAAAAAGGG ATCTTGAGGG CCGAGTTGAG CTGGATCTAC AAAGGCGGCT 240 

TTAGCCACTT TATGGGCATC GTCAATCTTG GCATCGTCGC TATTCCAAGC AGCAAGAGAT 300 

TTAGCGTAAG GATCGAGTGC AAGAACAGTT TTACCTTGAC GCTCGATTTG GTATTGATAA 360 

TAGTAGCCAG TGAAATCTGT GATTCCGAGT TTGTTTGTGC TGTCTAAAGT TTGTTTCCAA 420 

GTTCCTCTTT CCCCTTTTTC GAGAGCGACA GTTCCAACTA CTTTGTCAGG GTCATTCTTG 480 

TCGTAGACAA CAACAGAAAC CTTATCAGCA CTTGGTGACC AAAGGGTCAA ATCAACTTGT 540 

TTTCCTTCTT CTTTTAGGTC AAATCCCAGT TTGCCATCAT AGATCGAATG CCTCATCTTT 600 

CAGGCGCCAG GGGGATCCAC TAGTTCTAGA GCGGCCGCCA CCGCGGTGGG AACTCCAGCT 660 

TTTGTTCCCT TTANTGAAGG TTAATTGCCC CTTGGGGTAA TCCATGGTCA TANCTGTTTC 720 

CTGTGTNAAA TTGGTANCGC NCAAAATCCN NNNAAATANA ACCGGAACCT AAATTTAAAC 780 

CCNGGNN 7g7 

(2) INFORMATION FOR SEQ ID NO: 2 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 



60 



CTTGTTTTTA TTGGTAACTA TACTTTAATG CGATAATTGA TTGGACTTTT GTCAAGGAGA 

TGTGTATGTT TGAAGTAGAA GAATGGCTTC ATAGTCGGAT TGGTTTGAAT TTTCGATCAG 120 

GTTTGGCCCG AATACAGCAA GCGGTGGATT TGTTAGGAAA TCCTGAGCAG TCTTACCCTA 180 

TTATCCACGT AACAGGGACT AATGGGAAAG GATCTACCAT TGCTTTTATG AGGGAATTAT 240 

TTATGGGGCA TGGCAAAAAA GTTGCGACCT TTACCTCCCC TCATATCGTA TCCATCAATG 300 
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ACCGAATTTG CATTAATGGG CAACCTATAG CAGACGCAGA CTTTATCCGT TTGGCTGATC 
AGGTCAAGGA GATGGAGAAA ACGCTTCTGC AAACTCCTGA CCAGTTGTCC TTTTTTGAAT 
TGCTGACCTT GGTTGCTTTT CTTTATTTTA GGGAGCAGGA GGTGGATTTG GTTTTATTAG 
AAGTGGGAAT TGGTGGCTTA CTTGACACGA CCAATGTGGT AACTGGAGAG CTTGCTGTCA 
TCACCTCCAT TGGGCTTGAC CATCAAAAAA CCTTGGGTGA TAGTTCCTAG AACCAATGGC 
AAACCAGAAA GCTGGTTATT TTCAAGGCTG GTAAGAAGGC AGTGATTGCG AAATTGCCTC 
CAGAAGCTAG GCTTGTCTGT CAGAAAAAAG CCGAATCTTT AG 

(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:212: 

CCTGATAACT ATACCATGTG CGTTTTTTCT CTTTCCCAAA GCGGCGAAGC TCCATGGCAG 
TCGCAATGGT ATCAATGCGT TTCTAGCGAG CTAAAAATCA AGGGCGTAAT AATGAGCAGA 
TTGCCTTTGA TTCGTTGCAT AAGAGAAGCT TTCTTGGATA ATTCCATCCC ACGCGCCTCC 
TGAGACATCT TGATAGTAAA GAATTCTTCC TGCAAATCTG GAATATAGCG CAAGGTCAGG 
CTGACAGAAT AAGCAATCTT ATAGGGCACA CCAATTTGAT TTAAACTGGA AGCAAACTGA 
CTAGGATGGG TTGTCATCAA AAAGATAATA GCCAGAGGAA TGGTGCAAAG ATACTTAATG 
ACCAAATTTA GCAGATAAAA GAGCTCCTGG CTGGTTAGAG TGTAGACACC GATTCCCTGC 
CAAATCACAC TTCTCTCTCC ATAAAGTCCA ACCCCATACT CGGGAGAAAA GAGATAGACC 
ATCAAAACGT TTAAAACGGC AAATACCGTC GCAAAAACGG CTACAAAGGA AACATCTTTA 
AAGCGAATTT CTGATAAATA GAGGAGAAAG ACTGAAAAGA TGGCAATCAG CAAGAGCATT 
CTGGTATCAT AGCTAATCAT GG 

(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 886 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



360 
420 
480 
540 
600 
660 
702 



60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 
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CTTATATGGA GGGCTTATAA TGGCAGATCT AAAATTAATC GCATTGGACT TGGACGGTAC 
CTTGCTGACT ACTGATAAAA GGCTGACGGA TCGCACCAAG GCAACTTTGA AAGCTGCGCG 
TGATCGTGGT ATCAAGGTCG TATTGACAAC TGGTCGTCCC TTAAAAGCCA TGGACTTCTT 
TCTCCATGAG TTAGGGACTG ACGGTCAGGA AGATGGAGTA TACCATTACT TCTAACGGTG 
GTTTGGTTCA GAAAAATACA GGTGAAATCC TTGATAAAAC AGTCTTTTCA TATGCTGATG 
TGGCACCCTT GTATGAGAAA ACAGACAAAT TATCACCTGC CACTTGATGC CAATCTCAGA 
AAGAACAGTT TATCAAATCC AATCGGACCA AGAAAGTCTT TATGCCAAAT TCAATCCAGC 
TTTGACTTTT GTTCCAGTTG ACTTTGAAGA CCTGTCTAGT CAAATGACCT ACAACAAATG 
CGTGACTGCC TTTGCTCAAG AACCCTTGGA TGCAGCCATT CAGAAGATTT CTCCAGAATT 
GTTTGACCAA TATGAAATCT TTAAATCACG TGAAATGTTG CTAGAGTGGT CACCAAAGAA 
TGTTCATAAA GCAACAGGTT TGGCAAAACT AATCAGCCAT CTTGGAATCA ACCAAAGCCA 
AGTTATGGCC TGTGGTGACG AGGCCAATGA CCTTTCTATG ATTGAATGGG CAGGTCTTGG 
TGTCGCTATG CAAAACGCTG TTCCTGAAGT AAAGGCAGCC GCAAATGTAG TAACGCCGAT 
GACCAACGAT GAGGAAGCTG TCGCCCGGGC TATCGAACAA TATGTGCTAA AGGAGAACTA 
AGATATGGGA TTGTTTGACC GTCTATTCGG AAAAAAAGAA GAACCT 

(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 671 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:214: 

CTATGAGACC TAACACAATT ATTCTACCAA AAATTCAATT AAAAGTCAAT TCTCTATTTA 
TGGTAGGGGA ATCCCTGCTG AATCGTAAAA GCGCGATAGA TTTGTTCAAC AAGAACTAGT 
CTCATTAACT GATGAGGCAA GGTTAGGCGA CCAAAACTGA CAGAAAGATT GGCTCTATTT 
TTTACAGATG ATGACAATCC TAAACTTCCC CCAATAATAA AAGTAAGAGT AGAAATCCCT 
CCTTATAGAA GTTTCTTCTA ACTGCTTACT AAATTCTTCT GAGAAGAAAG TTTTCCCTTC 
AATGGCTAAC ACAATAACGA AATCACGGTC AGCAATTTTT GATAAAATTC TCTGACCTTC 
TATTTCTAAA ATCTTTltSAT TTTCTGATTC ACTGGCCTTA TCTGGTGTTT TTTCATCTGA 
TAACTCAATC ATTTCAAACT TAGCAAATCT AGAAATTCGT TTTGAATACT CTGCGATACC 
ATCTTTTAAA TACTTTTCTT TCAGTTTCCC AACTGTTACA ACTTTAATTT TCATGACTCT 
ATTCTAACAT ATTCTCTATT TTTTCACATC TTATTCACAA AATAAAAAAT AGATTTCAAT 
TAAGAAAATC ACAGTTTCAA AAGAGTTATC CACAGTTTGT GTAAAACTTT TGTGTTTAAG 
TTATAATTAA G 



60 
120 
180 
240 
300 
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420 
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840 
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(2) INFORMATION FOR SEQ 



ID NO:215: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

CTGGGCTGAG TTTTCCAGAT ACAAATACTG CAACCGCAAC TGCAATCCCC CAACCCATAG 60 

TAATCACAAT CCAACCTGCA CTGTTGCTCT TGGTTTTTGG AAGAACCACA CCTGCAACAA 120 

CACCATTTCC TAGAAGAATC AGGATTAAAG TCCCTAAAAA TTCTCCAAAT AATTCACTCA 180 

TCATT TTTCT GTCTCCATTA AAAAGAAGGG GCGGGCGACA AGGATTGCTA CCCTCCACCT 240 

CTTTTATTTT TTCTTAATTT TTTAATTCTG CTAAGTCGTT TTGAGCAAGA GCTGCTTTTA 300 

CATCAGCACG GTAAGTTGCT TTTTCTTCTT CTGTCCAGTC ATAGAATCGT CCCATTTCAT 360 

CCAAAACTGG CTCAACGATA CTATCCAAGC TATCACGCAT AAAGAGCATG TGATTGGTAC 420 

GACGAAGAAG GAAGTCAACT GGGCTAAGAG TCAACTCATT GCGCATTGCA TAGTGAAGGG 480 

ACAAGGTGTC TGCCAAGCTA AGACCTGGCG CTTGTTCCAA G 521 

(2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

CTGGAATAAC ATTATAACGA CCAGTCATGT ATAATGGAGT TTGTTCTCTA TCAGTATTTA 60 

CGTTTACAAA GAACTTACCA TTTCCTTTAT TTGAATAGTC TCCATGTATG AATACTTTTG 120 

TATCCTCAAG TACTTTTCTT TGGATGGTCC T3GATGCGCT ATTACATATC TAAAAAATTC 180 

TTTTATTACT TCGCTAAAGA GTAAGCTTTC TTTATTATTT ACACCATAGG TATAAGCTGG 240 

GTGTTCAAAA TTATAAACTA CACCTCCATT AAGGTAAACC CCTAAAGCTT CTTCTCCTAG 300 

CATAGACTCT GGTTCAGTCA ACCATTGTCT ATCTCCTTGA CTTTTTCCGA TATTACCACT 360 

TGCAAACAGT TTCCATTTAC CTGTCTCATA CCACTTCCAA GTATCCATCA AACCACCCCA 420 

TTGATAAGTA TGATTTGAAA GCCAAAGTCC CTTCATATAA CTTTCTGTAG TTGAATCATT 480 

TCCCTCAGCA GCAGGTGTAT TTTTGAACAT GAAAATAAGG TTCTTCCAGT ACTTATCAAC 540 

AGATTTCTGA AATGCTATCT TACCGTTTTT TCCAAATGCT TTTTCTATTG CAGAACCATT 600 
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ATTTTGTTCA GCCGGGGCAT ATAAAATATC CTCCATTTTT AGCTGATACT TTTAAGTAAT 660 
CTGCCGCCTT ATTTTCAATA TCATTTGCCC AAATCCAATA ATTCTCTGTA CTGAAAATAC 720 
CATGCAAATT TGGATATTTT TGAT 

744 

(2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:217: 

CTGAAATTTT GGAAGAAAAA ATCGGTGTAG ATACAGTCCA AAAAATAGGA CGCATCTTGA 60 

TTTTGTTTAA ACAATCTAGC AAGAAAGAAA ATCGCAAGAT TTCTAAGAAA GTCAAAGAAA 120 

TCTAAGATCG AAACTCCAAA TAACTGTTTT TATAGAGAAA TAAAGGGGAC TAGCCTATGA 180 

CAATCGAACT ATTGACTCCC TTTACCAAGG TAGAGTTGGA GCCAGAAATC AAGGAGAAAA 240 

AACGCAAACA AGTTGGGATT TTAGGGGGGA ATTTTAACCC TGTTCACAAT GCCCATCTCA 300 

TTGTTGCGGA TCAAGTACGG CAACAGTTGG GACTGGATCA AGTTCTGCTC ATGCCTGAAT 360 

ACCAACCTCC TCACGTTGAT AAAAAGGAAA CCATCCCTGA ACACCATCGT CTCAAGATGC 420 

TTGAGTTTGG CAATTGAAGG GAATTGAAGG CCTAGTCATT GAAACCATTG AGTTGGGAGC 480 

GCAAGGGTAT TTCCTACACC TACGAAACCC ATGAAAGA 5I8 

(2) INFORMATION FOR SEQ ID NO: 2 18: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 886 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

CTTGCTTNGN GGCTCGCTTA CTTNGACCCA CGTCGCTATG ACTTGGCAGC AGTTGGTCGT 
TACAAAATCA ATAGAAAAAC TCAATGTTAA AACACGTTGT GTTCAACCAA ACCATTGGCA 
GGAGCCATTG GTAGACCCTG AAACTGGAGA AATCTTGGTA GAAGCTGGTA CGATTATGAC 
TCGGTAGCGT GATTGAAAGC ATTGAAAGCC ATTTGGATGG CGACTTGAAC AAGATTGTCT 
AACATCCCAA ACGATGCAGC CGTTGTGACT GAGCCTGTTG TTCTTCAAAA ATTCAAGGTT 
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GTTGCTCCAA CTGATCCAGA TCGCGTCGTA ACGATCATTO GTAATGCTAA CCCAGATGAC 360 

AAGGTTCGTA CGGTGACTCC TGCAGATATC CTTGCTGAGA TGAGCTACTT CCTCAACTTG 420 

GCTGAAGGAC TTGGCCGTGT AAATGATATC GACCACCTTG GAAATCGTCG TATCCGTGCG 480 

GTTGGTGAAT TGCTTGCCAA CCAAGTACGT TTGGGACTTT CTCGTATGGA ACGTAATGTC 540 

CGTGAACGTA TGTCTGTTCA GGACAATGAA GTCTTGACAC CACAACAAAT TATCAATATC 600 

CGTCCTGTAA CAGCTGCGGT TAAAGAATTC TTTGGTTCAT CACAGTTGTC ACATTTCATG 660 

GACCAACACA ACCCGCTTTC TGAGTTGTCT CACAAACGCC GTTTGTAGCC TTAGGACCTG 720 

GTGGTTTGAC TCGTGACCGT GCCGGATATG AAGTACGTGA CGTGCACTAC ACTCACTATG 780 
GTCGTATGTG TCCAATCGAG ACACCTGAAA GGACCTAACA TCGGTTTGAT CAATAACTTG 
TCATCTTTAC GGACACTTGA ACAAATATGG TTTTGTTCAA ACACCA 



(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 base pairs 

(B) TYPE: nucleic acid 

239 
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(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

CTCTCTTGAT GGCTGAACAT GCGCCAAATG CTAAGATTAC AACTATTGAT CGTAATCCAG 60 

AAATGATTGG TTTTGCCAAG GAAAATTTTG CCCAGTTTGA CAGTCGCAAG CAAATCACTC 120 

TCCTAGAGGG AGATGCGGTG GATGTCTTAT CTACACTGAC AGAGTCTTAT GATTTCGTCT 180 

TTATGGATTC TGCCAAGTCT AAATACATCG TCTTTCTGCC AGAAATCCTC AAACATTTGG 240 

AAGTTGGTGG TGTGGTTGTC TTGGATGATA TTTTTCAAGG TGGTGATGTT GCCAAGGATA 300 

TTATGGAAGT CCGTCGTGGT CAGCGAACCA TTTATCGAGG CCTTCAAAAA TTATTTGATG 360 

CAACCTTAGA CAATCCAGAA CTCACCGCAA CATTAGTGCC TTTAGGAGAT GGTATTCTCA 420 

TGCTTCGTAA AAATGTAGCA GATGTTCAAC TGTCTGAAAG CGAATGATTT TCAGAAAAAT 480 

TTAAGAAAAA ATAGTAAAAT AGATAGAGTA ACACTTATCT CAAAGGAGTA GACATGAAGA 540 

AAAAATTATT GGCAGGTGCC ATCACACTAT TATCAGTAGC AACTTTAGCA GCTTGTTCGA 600 

AAGGGTCAGA AGGAGCAGAC CTTATCAGCA TGAAAGGGGA TGTCATCACA GAACATCAAT 660 

TTTATGAGCA AGTGAAAAAC AACCCTTCAG CCCAGCNAGT CTTGTTAAAT ATGACCATCC 720 

AAAAAGTTTT TGAAAAAACA ATATGGCTCA GAGCTTGATG ATAAAGAGGT TGATGATACT 780 
ATTGCCGAAG A 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

CCAACTCTTG AATCCGATTA TTTAAATTAT CAAAATCAAA GATTGTTGTT TCAGAGAAGA 
TTTTTGGGTC CGGTGAGAAG TGAACAGTTG TTCCTGTTTT ATCCGTATCT CCAACTATTT 
CAAGTTATGC GACAACATGA CCACGACGGT ATTCTTGGTA ATGAATCTTA CCATTTTTGT 
GAACATGAAC GTCTAATTGA GTCGAAAGGG CATTAACAAC TGACGACCCC ACCCCGTGAA 
GACCACCTGA AACCTTGTAT CCACCACCAC CGAACTTTCC TCCAGCGTGA AGGACTGTAA 
AGACGGTCTC AACAGCAGGA CGGCCTGTAA ATTCCTGAAT ATCGACTGGG ATACCACGCC 
CATCATCCAC AACAGTAATC GAATCATCTG GCTCAATAAA AACTTGAATA TGGCTGGCAA 
ATCCTGCCAA GGCCTCGTCA ATTGAGTTAT CAACAATTTC CCAGACTAGA TGGTGAAGAC 
CTTCTTTTGA AGTTGATCCA ATGTANATCC CTGAACGCAT NCGAANAGCC TCTAAGCCCT 
CTAAAACTTG AATTTGACTG GCATCATAAT CCTGTGCCTG CAGATTTCTG ATTTCTCCTG 
TCATCTAATG CCTTTTTCTT ATATGTCTAA TACTTGTCTC AAATTCACGA TACTGGTAAA 
CAAGTGGGTA TCAAGTCCTG CAGCTTGACC TGCTTCGATA TCAATCNG 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

TTTAAATTTT TTNTATGNNC TCCTACTACG CGGTGACTAG CATCATAAAC TGTAGTATCC 60 
ACTAGACCCC TCTAAAGTAT CAACTTGGTA ATCCAAGAAC ATGCCATCAT CACTTAAAAC 120 
TTGTTTTTTA TCTGTATACA TTTGATGAAT TTTAAAATTA TATGATGGAT AGCCTGTGAT 180 
AGTCACAGTT ATTCCTGTCA AATTTTTTTG ACTAGTAGGA AGACCCAAAG TCCCTAATTT 240 
TGCACCAATG GGCTCTTCTA GAATTAATAA AGCCAAGTCA TATTCCCTTG CATCCTTAGA 300 
ATTTAAATTT CTAAATTCCT TCAAATAACA AACTTCCTTT ACTTTGATCT TTCCAAATAG 
TTCTTGACTT GGACTAACAG CCGGAAGAAC ATAAATATCA TCCGCTTCTT TCCCATAGTC 
ATGTCTGTAG TTAGTTGTGA GCCCGCTGGT AATTAACACA TTATCTTTAA TAAAATTCGC 
ACTTCCCCTC CCTTCTACAT TCATTCCATC GGCTACTCCA TAATACTTAG ATAAAACCAT 



AG 
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(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1028 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 



CTCGTTACCC AGAGTTCGCT CAACTTGAAG GGCAACTTAA AGGGNTTGNG CAATTGAAAA 
AACACGGAAT TGAAGGTGTA GTTGTTATCG GTGGTGACGG ATCTTACCAC GGCGCTATGC 
GTTTGACTGA ACATGGCTTC CCAGCTATTG GTCTTCCAGG TACAATCGAT AACGATATCG 
TTGGTACTGA CTTTACAATC GGTTTTGACA CAGCGGTTAC TACTGCCATG GACGCTATCG 
ATAAGATTCG TGATACATCA TCAAGTCACC GTCGTACTTT TGTAATCGAA GTTATCGGAC 
GTAACGCTGG TGATATCGCT CTTTGGGCTG GTATTGCAAC TGGTGCTGAT GAAATCATCA 
TCCCTGAAGC AGGCTTCAAG ATGGAAGATA TCCTAGCAAG CATCAAAGCT GGTTATGAAT 
GTGGTAAAAA ACACAATATC ATCGTCTTAG CTGAAGGTGT GATGTCAGCG GCTGAATTTG 
GTCAAAAACT TAAAGAAGCT GGAGATACAA GCGACCTTCG TGTAACAGAA CTTGGACATA 
TTCAACGTGG TGGTTCTCCA ACTGCGCGTG ACCGTGTTTT GGCGTCACGT ATGGGTGCAC 
ATGCTGTTAA ACTTCTTAAA GAAGTATCGG TGGTGTTGCG GTTGGTATTC GTAACGAAAA 
AATGGTTGAA AATCCAATTC TTGGTACTGC AGAAGAAGGG GCATTGTTTA GCCTTACTGC 
AGAAGGTAAG ATTGTGGTTA ACAACCCGCA CAAAGCCGAT ATTGAGCCAA GTCTTCTGGA 
CCATTTTTTT GTAGATACTC ATCAGGATCC AAGTTATCAG GCATGCTAAC GATTTGCACA 
GGCATATCAC CAATTTCATC CAATGCTTTC AATATCGCGG CTTGCCCAGC CTTATCTCCA 
TCGTAAACAA GAACCAATTT CTTGGTTAAC CTTTTCAGAT GCTCAACATG CTCTCGACTC 
AAGGCCGTTC CCATCGACCC ACAGCATTTT CGATTCCACC CGATAGGCTG CAATAACATC 



(2) INFORMATION FOR SEQ ID NO:223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 555 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 
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CTGAAGAAAA TCAGTAAACA AGAGCTGGTT AACACGCGCT TTTCTCGTTT ATTTGCTCAT 
TTTGGACAGG AAAAAGACGG TAGTTTTCTT GCCCAGCGTT ACCAATTTTA CCTCGCCCAG 
CAGGGACAAA CACTATCGGG CGCTCATGAT CTCTTGGACA GCCTCATTGA GCGTGATTAT 
AACTTGTATG CTGCGACAAA TGGCATTACT GCCATTCAGA CAGGACGTTT GGCTCAATCT 
GGTCTAGCAC CTTATTTCAA TCAAGTCTTT ATCTCAGAAC AGTTGCAAAC TCAAAAGCCG 300 
GATGCTCTTT TTTATGAAAA GATTGGCCAG CAAATTGCTG GATTTAGTAA AGAAAAGACG 360 
CTGATGATTG GAGATTCTCT AACCGCCGAC ATTCAAGGTG GCAATAATGC GGGGATTGAC 
ACTATCTGGT ATAATCCTCA TCACCTCGAA AATCACACAC AAGCCCAGCC GACTTACGAA 
GTCTATTCTT ACCAAGACTT GCTGGATTGT TTAGATAAAA ATATTCTTGA AAAGATCACG 
TTTTAAGGAG ATTAG 



60 
120 
180 
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(2) INFORMATION FOR SEQ ID NO: 224: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 764 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

CTCATAACCG TTCAAGNCGN TATCGTAAGC AATGGCACGA ACCTTAGCCA AATCACCTTC 60 

TAATTCGTGA GCAGTCTCTT CCTGTGGAAG TGTGAATGGA TGGTGGGCGC TCATGTAGCG 120 

GCCTTCTTCT TCAGACCATT CAAAACATCG GCCAGTCAAC CACCCAAAGG AAGTTGAACT 180 

TATCATTATC AATCAAGCCA AGCTCTTTAG CAATACGTCC ACGAAGGGCA CCCAGTGTTG 240 

CATTAGCCAC TTCAAGCGTA TCCGCCACAA AGAGAACCAA GTCCTTATCT TCAAGAGCAA 300 

GCGCTGTTGT CAATTCTTCT TGGATACCAG TCAAGAACTT GGCAACTGGT CCGTTTAATT 360 

CTCCATCAAC CACCTTGACC CAAGCAAGAC CTTTGGCACC ATACTGTTTG GCTACTTCCG 420 

TCATCTTGTC GATGTCTTTA CGTGAATAGT TGTCCGCAGT CAAGGGCTCG AGACCGTTGA 480 

CCACAATCGC TTTTACAGCA AGTGCTTCTG AAAAGACTTT AAAGTCTACA CCTTTGACCA 540 

CTTCTGTCAA GTCCTGAAGC AACATGTCAA AACGAGTATC TGGCTTGTCA GAACCGTAAA 600 

GAGCCATAGC ATCATCGTAT TTCATATGAG GAACGGTAGC GTTACTTCAA TGCCTTTTGT 660 

TTCCTTCATC ACGCGCGCGA TCAAGCCTTC TGTAATATCT TGGATTTCTT GCTCAGTAAG 720 
GAGATTCCAA GTCGACTGAG TAATTCAGCT GGCGGTCTCC ACGC 



764 



(2) INFORMATION FOR SEQ ID NO: 225: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

242 



WO 97/43303 



PCT/US97/07950 



60 
120 
180 



(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 

CTGTATGAAC AATATAGGAA CGTCTCCGGA TAAACTCCTT GGATTAATGG AGTTTCTTTT 
GGCATGGATG AAACACCATT AAAGGAAATT CATAATATGT CCTGCCATAA AGGTTACATT 
GTTTTCTTTA CACGCATCTA CCATCTCGCG ACAATCTTGA TAAGAAAGCG GCAATTGGTT 

TTTTCAACAG AAAACATTTT TACCATGCTG TGCAGCCTTA ATAACCGGTT CCTTATGAAG 240 

ATTATTTGGG AGTTGCGACG ATAACACAAT CTACTTCATC GCTAGAAACC AACTCATCTA 300 

AGGAACTTGC TACTTTTGCT CCCAATTCTT CTGCAATCGC CTCTGCATTA TCTGGATCAT 360 

AGAGAAGAGT AATCTCTGCT CCATCATTCT TTTGCATATA GCGAGCCAAT TCAGCTCCAA 420 

AATACCCTGC TCCAACAACA CCGTATTTAA CCATGTTTTT TCTCCTTTAT TTTGTTAAAT 460 

CATTAATTTC CAAATCTATA TACTGAACTA CATTGCTTAA ATGCAATTCA TTTCTCGACC 540 

ACGAATCATA TTTTTCAAAC AGT 5g3 

(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

ACTATTGAGT CACTTTGGGA AGCGAACATG GAGTATATTT CTCCAGAAAA TGCCTTGGAT 60 

AGTCGTAACC GTCAATGGAA GATTTACTCA AGAAACTTGA TTTCACCACC AAACTTCCTC 120 

GGGGCAAATG CTCATGTGGA AGACTCATTA GTTGTAGACG GATGTTTCGT TGAGGGGGGA 180 

AACTGGTAAA ACATTCTTAT CCCTTTCAAC AGGCGCGGCA AAGTTTCGCG AAAGGAGCGG 240 

GAAGTCCCTT GGATTTCAGT TATCATGGAG TGGAGCTATC ATTGGTCAAG GAGCTAAGAT 300 

TAAACGTGCC ATTATTGGTG AAGGTGCGAT TATTTCTGAC GGTGTCGAAA TTGATGGAAC 360 

AGATGAAGTA CAAGTTGTAC GATATAATGA AGTAGTGGGG GTAGCAACAG ATGAAGATTG 420 

ATAAATATTC TGCCATTTTA GGAAATACAG TTGGTTTTCA CAATATGTCG ACATTGACGG 480 

ACCACCGTCC AGTAGCAAGT TTGCCATTTG GTGGGAAATA TCGTTTGATT GACTTCCCGC 540 

TTTCAAGCCT TGCTAATGCA GGTGTTCGTA GTGTCTTTGG TATTTTCCAG CAGGATAATA 600 

TCAGCTCAGT ATTTGACCAT ATTCGTTCAG GACGCGAGTG GGGCTTATCA ACCCTTCTTA 660 

GTCATTACTA CCTAGGAATT TACAATACCC GTGTAGAAAG TTAGTACAGT TGGAAAAGAA 720 
TACTACCAAC AG 



732 
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(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 227: 
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CTCGTGTGAT TTTTTTGCCT ATAAATAATT TTTATCAGCC TGATATATAA TATATATTAT 
GCAAGTGATT GGAAGTGTGA TAAACTAATA TAACAACGAA AATCTTATGA GAGAATAAAG 
CTTTATCTTA GATAAACTTA GGAATCAGGA TTTTATAAGA TTTTCCAATA ATATTAGTGT iOU 
CAATAATAAG AAAAGAGGTA TCTTATGACA ACATTTACTA TCCATACAGT AGAGTCAGCA 240 
CCAGCAGAAG TGAAAGAAAT TCTTGAAACA GTAGAAAAAG ACAACAATGG CTATATTCCC 300 
AACCTAATCG GTCTCTTGGC CAATGCCCCG ACTGTTTTAG AAGCCTACCA AATTGTCTCA 360 
TCTATCCACC GTCGCAACAG CCTGACACCC GTTGAGCGTG AAGTGGTGCA AATCACGGCA 



420 
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GCCGTGACCA ATGGTTGTGC CTTTCTGTGT CGCAGGTCAC AACACCCTTT TCCATCAAAC 480 
AAATCCAGAT GAATGATGAC TTGATTCA 

(2) INFORMATION FOR SEQ ID NO: 228: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 529 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 



CTGAACCCGT ATTAATAAGG GCATCCAACC TGGAAATGAT CAGCCAAAAT TCGCCACACA 60 

ATTCATAGCA GACATGACCT TACCAATTCC ACTTTCTACA AGAACGACTT CATGAGAAGC 120 

AATGGTTCCT GTATGATAGG TATTCCCCAA AACAACTTGC TCCTGGGCAT TATCTAAATG 180 

CTGGACCAGA TAAGCCAGTT CTTCTGGCAT AGCAGCAATA ATTCCTATTT TCATTTCAAT 240 

TCCTTTTCTA TTACAAAAGT TTCATTGCTA AAACAAGCAA AATCAAGAGA AAGATGACCG 300 

CAAATAAGAT TTTATTCAAC TTAGAATTGA AGACATTTCT CTTGGTATTT TCAATACGAC 360 

GGCTTTTATG AATAGATGGT TCGACCTGAA TCTTCAAGGT TTCCTGGCTA AAACCATGAT 420 

CCTTAGGATT GGCATAACCA AAACGGGAAG AAGAGGTTGG TAAAATCTTA GTTTCCTCAT 480 

TATCATCTAG CAAAGGAGAG CACCGTTGAA ATTTTTTCGC CTCTATTAG 52 9 
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(2) INFORKAT I ON FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 730 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 

AGCTTATTTT GATAGAATGA TTAAAACAAA ATCTAGAAAG GGAATCTATG TTACACAACG 
CATTTGCCTA TGTTACAAGG AAGTTTTTCA AATCGATTGT CATCTTCCTG ATTATTCTCC 
TTATGGCGAG CTTGAGTTTG GTCGGCTTGT CAATCAAGGG AGCTACTGCC AAGGCTTCTC 

AGGAGACCTT TAAAAATATC ACCAATAGCT TCTCCATGCA AATCAATCGT CGCGTCAACC 240 

AAGGAACGCC TCGTGGTGCT GGGAATATCA AGGGTGAAGA CATCAAAAAA ATCACCGAAA 300 

ACAAGGCCAT TGAGTCTTAT GTCAAACGTA TCAACGCTAT CGGAGATTTG ACTGGATATG 360 

ACCTGATTGA AACGCCAGAA ACCAAGAAGA ATCTCACTGC TGATCGTGCC AAACGTTTTG 420 

GAAGTAGCTT GACGATTACA GGTGTCAATG ACTCCTCTAA AGAAGACAAG TTTGTCTCTG 480 

GTCCTATATA AACTAGTCCA AGGAGAGCAC TTAACCAACG ACGACAAGGA TAAAATCTTC 540 

TCTTCGCACA AGGACTTGGC AGCCAAACAC GGCTGGAAAG TAGGGGACAA GGTTAAACTG 600 

GACTCTAATA TCTACGATGC AGATAATGAA AAAGGAGCCA AGGAAACAGT TGAAGTGACA 660 

ATCAAGGGAC TCTTTGATGG TCATAATAAG TCAGCAGTAA CCTACTCACA AGAACTTTAC 720 
GAAAACACAG 

730 

(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:230: 

CTGACATGGT TTAAAGAGAT TTTCAAAGAG TATCGGCCAT GGCCCCTCCC TGCAAACTAT 60 

ACATCTTGTG ATAAGTTCCT CCCAGAGCCA AGAGTTCCTC ATGGGTTCCA CTCTCGATAA 120 

TGCGTCCCTT ATCCAAGACA TAGATGCAGT TGGCGTCTTG AATAGTAGAA AGGCGGTGAG 180 

CGATAGCAAT AGTTGTTCGG CCCTGTCTCA TCTTCGCCAG AGAAGCTTGA ACCAAACTTT 240 

245 
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CTGTTTCAGA GTCAATATTG GCTGTCGCTT CATCCAAAAT CAGGATTTTA GGCTGGCTGG 
CGACTGTTCT AGCAAAGGCA AGAAGCTGGC GCTCCCCAGT AGAGAAGCTC GAACCACGCT 
CGGAAACAGG GGAGTCGTAC CTCTGAGGAA GTTCTTGAAT AAAGGAATCT GCATCCACAA 
AGGCTGCCGC AGCCTTGAAC CTGCTCATCA CTGATTTCTT GGTACATGGC GAAATTGGAC 
TTAATAGTTC CCTTATAGAG AGGTCCTGCA GAACCAAACC GATGTTTTTT CTCAG 

(2) INFORMATION FOR SEQ ID NO: 231: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

CGGTTAAATC GAAGGAACAA TCGCCATCAG GATAACAGCT ACCCACCCTA GCGTATCTTC 

CAGAAAATGC AGACTCAGAA TAGACTCATT CTTTGTCTTT CCCTTACCAA CCACCAGACT 120 

CGCTAACAGA TTGATAGTAA TCGCAATAAT TCCTAACCAG AGAATCCCCT CATCATTGAC 180 
TGGTTGCGGA TGCAAAATCT TCGTGACATT TTCCAAAATG ACTAGAACAG AGCCTGTTAC 
GAGAATCACA GCTGTTACCA AGGCTCCTAG CAGGCTAAAT CGCTTATAGC CCAAGGTGTA 

CTGATTGTCT TCTTCACGAT TGGAGATTGT TTCTAGAAAA GCTCATATTC CAATTGCAAT „ u 

CGCATCTCCC AAGTCATGCA CAGAGTCAGC AAGAACAGCG CTAGAACCAA ATACTCCACC 420 

TGCAATAAAC TCAACAATGG CATAAGTCAA ATTTAAGAAA AAAGCCACCC AAACAGCATA 480 

TTTTGCCTTC ATATTTCTCA TTCCTTTGTT ATAATAGATT TATGAACACC TTGTTCATTA 540 

TCATTATCCA CTAAAACTCA AAGAAAGGAT AGAAGCAAAA CGTCAGCTTT ATTCAGTTCT 600 

GAACAATTTG CCTTAAGTGT CCATATGACT AACATTGACC GCCGTATCAG CAAAACCAAA 660 

AAAGCCATCT ATCAAGCTTT TATACAACTT TTGAATGCTA AGGGCTACGA GGCCACTACT 720 
GTTCAGGATA TCATTGATCT CGCAGATGTG GGACGATCCA CCTTTTACTG TCACNCTATG 
AGAGTAAGGA G 



60 
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(2) INFORMATION FOR SEQ ID NO: 232 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 



CTCAGGCGAT TACTAATTAC. ATGACTTCTG CCTCAAACTT TAATGTCGAT GAGGCTAGCC 
AATTCATTCA ACAATTTACA ATTACAAAAC AAATCGAACA AGTAGAAAAA CTATTAGAGG 
AGTAGCATGG AAACTGCATT AATTAGTGTG ATTGTGCCAG TCTATAATGT GGCGCAGTAC 
CTAGAAAAAT CGATAGCTTC CATTCAGAAG CAGACCTATC AAAATCTGGA AATTATTCTT 
GTTGATGATG GTGCAACAGA TGAAAGTGGT CGCTTGTGTG ATTCAATCGC TGAACAAGAT 
GACAGGGTGT CAGTGCTTCA TAAAAAGAAC GAAGGATTGT CGCAAGCACG AAATGATGGG 
ATGAAGCAGG CTCACGGGGA TTATCTGATT TTTATTGACT CCAAATGATT ATATCCATCC 
CAAGAAATGA TCCAGACCTT ATATAACCAA TTAATTCCAA GAAGAATGCC GGATGTTCCA 
AGCTGTGGTG TTCATGAATG TCTCTGCTAA TGATAAAACC CC 

(2) INFORMATION FOR SEQ ID NO:233: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 589 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 233: 

TCGGAACAGG GGATAATTTT CATTAATAAA ACCCAAAATT TTTTCATGGA GGTAAACTTT 60 

TCATAACCCT TTTTCAAGGA AAAAATGGCA ACCTACTAGC TAATAAACAA GGAGCTTTTA 120 

GTGGAAAATT CGGTATAAGG TAAAACTATA CCCTAACCAA TTGAAATAGC TATTAGCGAC 180 

TTTCTCTGAA ATATGGTATG ATAAAGGATA TACAAGGAGA TAAAATGAAT AATAATTTAC 240 

TGGTATTACA ATCAGACTTT GGTCTGGTTG ATGGTGCGGT ATCGGCTATG ATTGGAGTGG 300 

CTTTAGAAGA GTCTCCAACC TTAAAAATCC ATCACTTGAC GCACGATATC ACGCCTTATA 360 

ATATTTTTGA GGGGAGCTAT CGTCTCTTTC AGACGGTGGA TTACTGGCCT GAGGGAACGA 420 

CGTTTGTATC GGTTGTCGAT CCAGGTGTCG GTTCGAAACG TAAGAGTGTA GTTGCCAAGA 480 

CTGCACAAAA ATCAATACAT TGTCACGCCA GATAATGGGA CGCTTTCCTT TATCAAGAAA 540 

CACGTTGGCA TTGTAGCCAT TCGTGAGATT TCTGAGGTGG CCAACTAGG 589 

(2) INFORMATION FOR SZQ ID NO: 234: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 730 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:234: 

CTGAGTGAAG AAAAGTACCA CCACGAGAAA TGATGTCTCC TACTGAAGCT GCATCTAGGG 

GATGAATTTC ACCGGCAACC ATACCAGCAT ATCCGTCATA GATACCAAAC ACTTCCATTC 120 

CTTCTGAAAT TGCTTGACGA ACAACTGCAC GGATAGCAGC GTTCATACCA GGTGCGTCTC 180 

GGCCACTAGT CAAAACAGCA ATACGTTTCA TATTGGTTTA TGCTCCTTTT TCTTTTAACA 240 

TTCTTTCTTG ATTATATCAC ATTTGATTTT AAAATTCTTC TATTTTCCGT ATTTTTAGCG 300 

ATAAATCGTT TTCATAACGA TTTCATTCAA TTTCTCCTCT AATTCATTGG ATTTAGCTAC 360 

AAAATGATGG GGAGAAACGA TGGTTTTCTG TTCCTCTTCA TACCGGATGA TGACTGGGAT 420 

TGGGCCTTTA AATTGTTCTA AAATACGTGA AATTTCTTGA TCCGATTCAT GATTTTTCAC 480 

CTGTATCCAA AAGCGTTCAG CAACTGCTTC TCTTATTTCT TGTGCAATCA TTTGCAAACG 540 

GCCATCACGT GATTGTATTT TTCCTTTTAC ATAGTAGAAG GCTCCCTCTT TTATTTCCTG 600 

TCCAACCTGA CGATATAAGT CTGAAAAGAG AGTGACATCC AATTTTTTCT TACTATCATC 660 

TGCCTGTAAG AAGGCCATAT TTTCACCCTT TTTGGTACGA ATCACTTTTA TTTTCTGAAC 720 

TTCAACCAAT ?3Q 



(2) INFORMATION FOR SEQ ID NO: 235: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 5: 



TGTGATATCT 


TNTGCTTNGA 


AGGNACCTAA 


CGTGGCGGCT 


GTCGCTCGAA 


ACCAAGCGGT 


60 


CATCTCCGAG 


GAGAAGGTAT 


TCTCCTTCTG 


GGAACAGTAA 


AGCTAAAGTT 


GGTGTTGTAG 


120 


TTGACATCAA CNGTGAAGGC 


TTGGGCTTTT 


TGAGCGATAC 


TTCTAAAGAA 


AGTTCCTTTA 


180 


TTTCCTTCAA 


AGCCCTTGCC 


TGAGTAAGTG 


CTTTGGAGTT 


TGTCATCCTT 


GAAGCGTTTG 


240 


ATATAGTCTG. 


CTAGATAAGG 


CTCGTCCGTT 


TCTTTGTCAT 


TGGATGTAGG 


AGTTTATTCA 


300 


TTTTTCGTAA 


CGAATGGTGT 


CGCCAGGCAT 


TCCAATCACG 


CGCTTGACGA 


TGTCCTTATT 


360 


GCCATCTTCC 


TCATGGGCCA 


CCACGATATC 


AAAACGGTCA 


ATAGGAAGGT 


GTTTTACAAC 


420 


GAAGAGAATT 


TCGCCATCCG 


CTAGGGTCGG 


ATCCATGGAA 


TGTCCTTCTA 


CGCGAACATT 


480 


GCTCCAAAAA AAGATACGAC 


TTAAAGCTAG 


TAATGACAGA 


ATTAGGAGGA 


ACAGTCCCCA 


540 


CTCTTTTAAG 


AAATTTTTAA 


ATGAATTCAT 


AACTTACCTT 


TCTAAGCGTT 


TTTTCGCTTT 


600 


TTCAGTGTTT 


TTAAAGTGCA 


ATTTGGCGCA 


GAAGTTGAGT 


CCCTGCATAC 


CATAGGCTTG 


660 


CAAAATCTGG 


CTAGCCACCT 


TGTCAGAAGC 


CGTTCCAGCT 


CCACTTGGAA GCTGATAACC 


720 


CAGTTCTCGT 


CCCAGATTTT 


CAAGATTTTC 


CAGAAAGAGA 


TCACGCGCAA 


TGACAGAAGA 
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AACTGCGACA GACTAGTATT TGCCCTCAGC CTTTTCTTCT AAG 823 
(2) INFORMATION FOR SEQ ID NO: 23 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 512 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

CTTCTAAAAA GTCAGCATAC TGCTTTTCGC TAATATCCAT AAATTCAGGA TCATAGAATG 60 

AAAAGGCTTG GTCTAACTGG CTGTAGGAGT GTCATGGCCT TGGCATAGTA CTCTTGATAC 120 

TTAGCTTCAC GTGTATCCTG GTCATTCTTC ATATGAGCAT AAACGTAAAG CTTCTCCATC 180 

TGGCGTTCCA TTTCAAGAGA AAATTCAGTG ATTTCTAGTA GGTTATCCGC ACTATCCAAG 240 

AGATGGCCTT CATACTGGGC TACTGTCTCC AATTGTTCTG TTAAATCTTT TAAGGCTTCT 300 

TCCCAAGCCT GGTCAGTTGG GTAGATCGTT GATAGATCCC ATGTATCTTT TTCATTTATT 360 

TCATTTCTTT GTAATACCAT TAGATTCCTC CATCCTTTCT ATTCTACCAT ATTTTTCAAG 420 

AAATATTGCT GATAAAAGGC TGGTGGATAA AGCTTTTGCC AATCATTTTG AGGATTTTTT 480 

GGGTAATAAG GTATAAAAGT GCTGAATAAT AA 512 

(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:237: 

CTAATAATAA CGATTTTATG ACCAAGATGG TCATGGTCTG ATTGAGTGAG CCCGCGATAG 60 

AGTCGAAGAG TCGAATGGCT GAAGCATTTG GGACAACGCT GGACACCGCA GATTGATTCA 120 

TAAGCAAGCC CGCCTGACTT TCGATACTAG TTGCACTGGT ATCCCTTAAT GAGACCAGAT 180 

AAGTGTTGGC TTGGGGTAGC TGTCCGTAAA GTTGCTCATA GCTAGCCTGA CTCATATAAA 240 

TAAAGTGACC AACGTAGTTC TCAGTAATAG CAGCGACCTT TAGTTCCTGA CCTTCAATTT 300 

ATA AAGTC TG CCCAACCTTG ACACCTGCCA GCTGGGCGAG TTTAGCTGTA ATAACGATGC 360 

CATCTTTTAA TGTCAGCTCC TGCTGATGAT GTTGAAGATG GATAAAGGGA GTCAAATCTT 420 
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■ CCTTCTCTAT CATCATAAGA GTAATGTTTT GAAGACCAGC CTTGCCTTTG AAATCCTTGT 480 
CTAGCGTTTT AGAATAGATT TTCTGGTAGG CTAGTATCTC CTGCCCTTTC AACACTTCTG 540 
CTAG 544 

(2) INFORMATION FOR SEQ ID NO: 238: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 238: 

CTGGCTTTGT TTGATGAAAA AGATCAGTTT GTCCAAACAG TGACCATCGC TAGCCACCGT 60 

AAACAGAAGA ACTTTGACAT TATTAAATTC AAAGATATGT ACCATATCAA TACTATCGAA 120 

AAGTACAAGG GATACAGTCT CAAGGTCGCT GAGGAAGATT TGAATGACCT AGACGATGGT 180 

GAATTTTACT ATCACGAGAT TATCGGTTTG GAAGTCTATG AGGGTGATAG CTTGGTTGGA 240 

ACCATCAAGG AAAATCCTGC AACCAGGTGC TAATGATGTC TGGGTGGTCA AACGAAAAGG 300 

CAAACGTGAT TTGCTTTTAC CTTATATCCC ACCAGTGGTT CTCAATGTTG ATATTCCAAA 360 

TAAACGGGTC GATGTGGAAA TCTTAGAAGG GTTAGACGAT GAAGATTGAT ATTTTAACCC 420 

TCTTTCCAGA GATGTTTTCT CCACTGGAGC ACTCAATCGT TGGAAAGGCT CGAGAAAAAG 480 

GGCTCTTGGA TATCCAGTAT CATAATTTTC GAAAAAATGC TGAAAAGGCC CGTCAAGTTA 540 

GATGATGAAC CCTACAGAGG CGGTCAGGGC ATGTTGATCA GAGCACAACC TATTTTCAAT 600 

TCCTTAGATG CTATTGAAAA GGGAAATCCG CGCGATATTG TCCTCGATCC TGATGGAAAG 660 

CAGTTTGATC AGGCTTATGC TGAAGATTTG GCTCAAGAGG AAGAGCTAAT CTTTATCTGT 720 

GGGCACTATG AGGGTTATGA TGAGCGCATT AAGACCTTGG TAACAGATGA GATTTCCCTG 780 

GGCGAGTATG TCCTCACTGG TGGAGAATTG GCAG 814 

(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 727 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 
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CTATATTGTA NACTTTCTGA TCAAATTGAC TAGGTAAAAT CACCCCATAG GCTGTCAGAG 60 
TCCCACTTCC ACTTCCTGCG TAGCCCATAT CTCGCTGGGA GAGGATTTCA GCCGAATCCA 120 
CAAAACCAGT AATGGTATAA GTATGGTCTT TTAAAGAGGA ATGACCCTCT TCTTTTTCTT 180 
TAAAACTAAT CTCCTGTCCC ACGCTGTATT GGCCTTGCAA ATGAGTGGCC AAAGCGATTT 240 
CCTTGTCTGA CTGAGGAAGT CATCCCTTTC TTAGCTGAAA GGTTGAAATT CGCTCTGGTT 300 
TGGAGTACAG CCGAATGGCA TCCTGCCCAT TATCCATAGT CACATCTGTC AAATAGCCAA 360 
ACTCGACCTC TGCGCCCTCC GTCTGTTTTA GTTCTTCTTG GTATGCTTGA TCCAAGCCAT 420 
AGTTAGACAT GACTGCCAAA TCCAAGGTTT GAGCAGTTGT TAAATAAGCA TTAGCTGTCG 
TCTCCATGTT GGGACTGGTT ACTTTGAGGC CTACTAAGGC TAGAGATCCC AACATCATCA 
GGATCAAGAT GGATAAAAAA CGCCCCTTGG AGCCTGTGAA GGACTGAATT AAGTCCTTCC 
AATAAGTTTT TCGCTTGATC ATGCTAGTAC TCCAAACTGT CAATATCCTC AGGATGCTGG 
TTGAGCACCA CATCCTTGAC ACTGGCATCG TGCATATGAA TCACGCGATC AGCAATGGGC 
GCCAAAG 



480 
540 
600 
660 
720 
727 



(2) INFORMATION FOR SEQ ID NO: 240: 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 40: 

CTCAAAAAGC AGTAAAGAAA GAAGAAAACA GTCCTTTTGC TGGCTTACAA GGACTATTTG 60 

ACGGAGATGA ATAATGATCT TGTCAAAAAA ACGAGCACGA AAGGTGCTAG AAGAAATCAA 120 

TGCCCTATTC CAAGATGCCA AGCATAGTAA AGGTAATTTT ACCAATCATT AAGATACTCC 180 

TGGGAGCGGT AATGTAATCA GCCCAGAAGA CAGATGCAAC GGTAAATAAG ACCACACCAG 240 

GTCTCTTTGT TGTATATCCA ACACCCCAAA ATATGTATGT AGCGACAGAG AGAGAGATTG 300 

CTAAACATAT TTCTCGCTTG GGATGGTATC GGAATAAAGA TAAATTCCTT AAAAAATGTG 360 

CCCAACAGTT AGCTAAACGA TTTTGATGGT CAAGTCCCTC AGACACGTGA AGAATTGGAG 420 

AGTTTGGCAG GTGTTGGTCG CAAGACAGCC AATGTTGTCA TGAGTGTAGG ATTTGGGATT 480 

CCAGCCTTTG CAGTGGATAC TCATGTGGAG CGTATTTGCA AACACCACGA TATTGTCGAA 540 

AAATCAGCGA CGCCACTTGA GGTGGAAAAG CGGGTCATGG ATATCTTGCC GCCTGTTCAG 600 
TGGTTAG 



607 



(2) INFORMATION FOR SEQ ID NO: 2 41; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 398 base pairs 

(B) TYPE: nucleic acid 
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(2) INFORMATION FOR SEQ ID NO: 243 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

252 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

CTCATAGTAT TCCTCTGTTT CTTCTTTAAT CCGGATAGGC AGTCAATCCT GAAAAGACAG 
AGGCACTCCC CCTTTTAAAT CTTTTTCCTT ACAATTCAGC AAGAATCGCT TTAAGCAAGC 
CTTTGAAATC ACCTTTAACA CGTTCAGTCA CTTCTACAAC TTCTTCGTGA TTGAGTTCTT 
CTTGGAAACC AGCCGCAAAG TTAGTGATAC ATGAAATTCC CAGAACTTTC AAGCCAGAGT 
GGGCTGCCAC GATAACTTCA GGAACCGTAG ACATACCAAC TGCATCTGCT CCCAGTGTCT 
TATAGGGAAC GAATTTCTGC TGGTGTTTCA TAAGTCGGAC CAGTTAACTC CGATATAGAC 
ACCTTTCCAT CAAAGCTTGA ATATTAAGTT TTTTTAGA 

(2) INFORMATION FOR SEQ ID NO: 2 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

CAGTTCAACT GGTTGCAAAT GCAAGAGCCA AAACCCGCAA ATCCTTCTGG GCAGTTCTAA 60 

AATAGCTGGG ACACTCTGCA ACTGGATCTG CATCTTTTCT CTATCCAAAT TTCAAAGCCT 120 

GCTGACTGAA ACCCGATAAA ATCATGTGGG CCGCAAACAA CTCCTGACTG ACTGTATCGT 180 

GCAAATCCCG AGCAATTCGC TTCCGTTCTT TCTCGATGAT TTCCTCTTCA TGAGCAAGGC 240 

TATGATTTTC AGCTTTTTGA AGAGCTTCTG TCAAAAGGTT AAGTTTACCT GATAAGGACT 300 

TGAAACTGGC ATCCAAATCT GGATCTGCAA CCTGAACCAC TTCTTGCCCT GCCAATAAAC 360 
GCTTGAGATT AGCCTGCATT TTTCTTAGAG AAAGCTCTTC GATCCCTCGC CAAAACAGGG 
CTAAGAGACA GGTTATGGAC ATGCTGAAAA CCAACAATAA AAAGACAAAT TTTTCTGTTT 
TTTCGACATC GTGCAAAAAG ATAGACCAGT CAAAATCAAG TATTTCCAGC AAG 



420 
480 
533 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

CTACCAAGAG AGCCAGAATA CCCATAGAAT AGCTATAAGG TTTCAGTCAG AAAGGCTACA 60 

ACTTGCATCA GACCATTTAA AGCCCCATGA GTTTGGTACA AAGGCAATCA AGATAAAGAT 120 

ACTTGAGAAG AGAATAACAG GCATACCTGC AATGAAACCA TCACGAATAG CACGAAGATA 180 

GATATTACGA GATAGTTTTT CAAAGAAAGG CTTTCCTTTC TCGATAAATG CAATTAGTTT 240 

ATTCATTATT TAACTCCTCT CTTGTAGAGT TCAATTAAAT GATGCATCAA ATCTTTTAAC 300 

AAGATAGTTG TCATTAAGTG GTCTTGGCCA TGCATCATGG TTACACTATA AGCCAAGTCC 360 

TCACCTGAAG CTTCCTTAGT CAATACACTT GTTTGCGCGT GGTGAGCCTC TGCAATACAG 420 

(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 513 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

CTGTCTTTGA AAAGGAAGGT AAACTGCTTG AAGCCCAGCG TTTGAAACAG CGGACAGAGT 60 

ATGATATCGA AATGTTGCGT GAGATGGGCT ATACCAATGG GG TTGAAAAT TATTCTCGCC 120 

AACATGGATG GACGGAGCGA AGGAGAGCCT CCTTATACGC TTCTCGACTT CTTCCCAGAT 180 

GATTTCTTGA TTATGATTGA CGAGAGTCAT ATGACCATAG GGCAAATCAA GGGCATGTAC 240 

AATGGAGACC GTTCGCGTAA AGAAATGCTG GTTAATTATG GTTTCCGTTT GCCGTCTGCT 300 

TTGGACAATC GTCCTCTCCG TCGGGAGGAG TTTTGAGAGT TCACGTTCAT CAGATTGTTT 360 

ACGTTTTCAG CGACACCTGG TGACTTATGA AAATGAACAG ACCGAGACAG TGATTGAGCA 420 

AATCATCCGT CCAACGGGAC TCTTGGATCC AGAGGTGGAA GTCCGTCCGA CTATGGGACA 480 

GATTGATGAC CTCTTGGGTG AAATCAATGC CCG 513 

(2) INFORMATION FOR SEQ ID NO:245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 760 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

TCGCAGACAA ACCAAGGATG ATATGAANCC ATTCAATCTC TATGAAGCAA GTTGAAGCCA 
AG6AAGAAGA GCAAGAGCAG GCTGAACGAG AAGCTGAGAG TAAGAAAGAA CCTTACATCT 
ACTACATCCT TTCTTTTGCT AAGTTGGCTG ACTTGGTAGC TTTTGCCAAG ACAGTGACTT 
TTGAAATGGA AACTTCTGAA CTCTACAAAA TGAACGAGCG CTATTATGGT CTCTTTCACC 
GATTGTAGTG GACTATTGTA AATCATCCAA CGGCCCATAT CCAGCTTGGC TCTGATGGCC 
CGAAATGCGC GAGTTTGCAG ACGATAGTGA TATCAGTCGC TCAGTCTTAC AACAGTATGG 
TCAAGTCTTG ATGAGTCACG ATGCAGTGCT CAATCTGCAA AAATCGGCTA ATCCTTTCTG 
GAAAGCTNTN TTNNNATTTT AANANNAGCG AATCGTCTGA TCCNTTTTTT CTTTTTNTAC 
TGATATAGTG ATTTACTATN ATANGAATTT TCACAANNTT CTGTTATAAT GGCTATATTN ^ 
AAAANTTTCG ANGAGATAAN TATNACNNTT CANATTGCTT TNCTTTGATT TGGTTNNNTT 600 
GCAANTGGTG TGCCTTNCCT CCNTNNANGG ANAATGGAGG ACTANTCATT NCCNTCCGCN 660 
CNTTCANATA CCCAANTTGC TNTTGTATTN GTCCCNGATC ATCATNACTT CNCTCTCTTG 
CTCNCTCCNC GGGAATGATT TTTNCTTTGT TCCCATCGTT 



60 
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(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

CTGGTGAAAA AGAATTCGCG CTTGGACTTT CAAACGTGGT ATGAAGGCTC CTCAAGCAGC 60 

TGGTATTATC CACTCAGACT TTGAAAAAGG CTTTATTCGT GCAGTAACCA TGTCATATGA 120 

AGATCTAGTG AAATACGGAT CTGAAAAGGC CGTAAAAAAA GCTGGACGCT TGCGTGAAGA 180 

AGGAAAAGAA TATATCGTTC AAGATGGCGA TATCATGGAA TTCCGCTTTA ATGTCTAAAA 240 

ATTAATAAAT GGTGTCAATT AGGTTGGAAA AAAATTCCAA CCCTTTTGGC TTTTGAAAGG 300 

AAAAATAAAT GACCAAATTA CTTGTAGGCT TGGGAAATCC AGGGGATAAA TATTTTGAAA 360 

CAAAACACAA TGTTGGTTTT ATGTTGATTG ATCAACTAGC GAAGAAACAG AATGTCACTT 420 
TTACACACGA TAAGAATATT TCCAAGAATT CG 



452 



(2) INFORMATION FOR SEQ ID NO: 247: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 447 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

ACTTTTCCCA CAACCAGAGG GCCCTAGGAT GACAACAACT TCCCTTTTTT GATTTCTAGA 
GAAAGTCCTT GGAGGATGGG ATTGTCTCCG AAGGATTTGT TTAGTTCCTT GATTTCTAAG 
ATAGTTTCAG ACATTTAGTT GCTCCAATGT TTTTCTAAGT GAGTGGATAG TTTGGAAATA 
GGGTAGCAAA CTGCGAAATA TAAGACTAGA ATGGTTCCAT AAACCCAAAA TGAACCAGTT 
GGGATGGTCA GGCGATTGCT ATCGATGATT TGTTGTCCAA CTTTGGTCAC TTCCACAACC 
CCAATCAAAA CAACTAATGA AGTGGTTTTA ATCATCCGAG TGACAAGATT GATAGCCTGC 
GGTACCAGTC TTCTTAAGAC TTGTGGGATG ATGATGTGGT ACTAAAGTNG AACATTAGTC 
AAGCCGAGTT GCCTGTCCAC TTTTCAA 

(2) INFORMATION FOR SEQ ID NO:248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

ATAAGGGAAA ^CACGCCC GTAAAGGTCG CATCAAAGCG 

ZI^ CC G * CTCAGGAG AGGCGCCATT GTGCAATTCT AACAAAATAT CCCGTAGGAC 
ATGAATCCGT TCATCTGTCA TTAGTCTAAT CCAATCACTT CGTAGCCATT CGCTTCCAGT 
GTGCGTACAA TCTTGTCCAT AGGAGTTCCT GCTAGCTTAG AACCCTGTTT AAGTGATACT 
ZZ^l ™™™ CG CATTAAGGGA TTGGCAAGGG GTTTAAAACC CAACTCCACT 
AGAATTTCCA AGACTTCTGG ATGCTTGTCC ACCACTTCTG CAACAGGAAT TGACACATCG 
CAACTAGAAC ippg^*'^**^ CTCCATTGTA TGTTCTAATA CTCAATGAAA ATCAAAGAAC „ D 

433 

(2) INFORMATION FOR SEQ ID NO:249: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 511 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

CTTGTCATGT TTTAGCTGGA AATTTGAAAA AGGTAGCTTC TACCATGTAT TTGTTGACAC 
CAGTGAATGT TATTGTAAAT GTTGAAGATA TCCGTTTACC AGATGAAGAT CAACAGGGTG 
AGTTCGGTTT TGATATGAAG CGAAATAGAG TACGATAATG ATTTTTTTAA TTCGTATGAT 
TTATAATGCA GTGGATATTT ACTCCCTGAT TTTGGTAGCC TTCGCTGTCA TGTCTTGGTT 
TCCAGGTGCC TACGAATCCA GTTTAGGTCG TTGGATTGTA GCGTTGGTGA AACCAGTGCT 
TGCTCCCTTG CAACGCCTGC CTTTACAGAT AGCGGGTCCT TGATTTATCT ATTTGGGTTC 
GCGAATTGTT TTGGTTCGAT TTTTAGGAAA AAAACCTAGT ATGCGTTTTC CTGGCGAATG 
AATAGGAATG AATAAAGGAT TTATCCAGCA TTTCTCCCAT AGAAGAATTC GTCCATTTCC 
TTGACAAGGG AAATGGAATG GATAAAGAAG G 

(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

CTCTTCAATA CGGTCAATCA ACTTCCATGA CGCACCAACC TCATCCCAGT GGCTAAAGTT 
AGTTGAGTTG TTATTTAGGA CATCATAAAT CAATTTCTCG TATGGTTCTG GAGAAGCACC 
AGTTGCAGTC GCGTCTGTAC GATAATCAAG TGAGTTAGGA GCCAAGTTAA ATTCTTCTCC 
TACTTGCTTC CCATTTAGGC TAAGAGAGAA GCCTTCTGTT GGTTGAATAT AGATGGTCAA 
AATATTTGGA GCAAGTGGTT CTCCAAATAT AGAATCCATT TGTTTAAAGA CGATGTTGAC 
ATGACGTTTT TTCCTTTTTC AGTCAGTCGT TTACCTGTAC GGAAAAAGAA AGGATCTAAC 
ACCACGGAAT CGATCGCTGT CTACAAAGAA GGCACCAGAT GTAAAGGTTT CAGTTGTTGA 
TTCTGGATTC ACATTTGGTT CGCTACGATA AGAGATGTAT TTCATGCCAT CAATCTTACC 
AGAGCGGTAT TGTCCACGGA TAAAGTGTTC TTTGAGTTCT TCATCAGTTG GATGATAGAA 
GTTTTTAAAG ACTTAATCTT TTCAGCACAA ATCTCGTCTT TTGTGAAG 
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(2) INFORMATION FOR SEQ 



ID NO: 251: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:251: 

GANAACACAA CCAGAGTCAG CAATAATCTC CANGNCATAG AAATACTCCA TCTTTGTCAG 
TCATACATCG ACNAAGGTTC TGTCTCTCCN ACAATTATGT CATGAAAACC CTTGAAACAA 
AAGCATCATC CGTCTGTGGT GACAAGTAGA AAGGAAAATC TCATGTCTGA ACGTAGAATC 
TCTCAAAAGT CTCTCGAAAA TCTCAAAAAA TCAAACCAAG AATCCNATNT ATTAACCAAA 
GAAGCCAGGG ACACACGCCC TCGGGCAACT ATTGGAAAAA AAGGAAATGA CCAAGATTAG 
TATTTATGAA TTGGTCAAAC GAGCAGGTGT TTCGCGTGCG GCCTTTTATC GCAATTATGA 
CTCCAAAGAG GAAATTTCAG AGAGCGTATT TAAACGAACT GTCCACAATA TTATGGAACA 
GATGCATCAT TACGATTTAA AGACAGACCT TTATGTGGTT TGCTTCACCT TTTCCGGGAG 
GCCAGAAAGG AAGCTAGAGT AATTCAATTG GCCTTGGATT ACCATCTGGC AAAAATCTTT 
GTCCAAGCCA TGCAGGAATT TATAGAAAAA TACCATGGGA AATCGAAAGG CGTCAG 

(2) INFORHAT ION FOR SEQ ID NO: 2 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 52: 

CTTGTCATGA ACATTTGAAA CTGGGTCGCT TGTGATTTGA ATCTGAAGCA AAGGCAAGTG 
GCAATTCTGA ACCGGTCCAT TGGTCAGAAA TGTTTGCACC TTGCTCAGTT TGAGCAGATA 
CGCGAACATG AAGTTTAGTT GTTAATTGAG TACCTTCTAA GCGACCATTA ACTGTAAAGA 
CACCTTCCTT AGCGTATTGC TCTGGACGAA TCGCATCCCA TGCAACCTTA GCTGATGAAA 
CGTGACCATT TGAATCATAT GTCCGAACAC TTTCTGGTAA TTGTGGTGCT TCTGCGATTG 
GAGTTGTCAC ACTGACTTCT TCAACTGAAA CGATACCGTC TACAGAGACT TI^CACGCG 
CTTCAAGGTC AATTCCTTCA ACTTTACCTA GTACTTCAAA TGTGTGATAG GAGTCTACTT 
TGTCTTTCGG AATAGCTTGC CAAGTGACTT TATGAGTTTT AGGGAAACCT TTGTCATACT 
CAACTGTTAC TGTTGCTGGA AGACTTGGTT CCTGATGCAA A 

257 
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(2) INFORMATION FOR SEQ ID NO:253 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 



TAGAATGATG CTCTGCTAGC AGGATTATTC ACGCTACAGG TCACCAGAGC TTTCTTGATG 60 

TTCTTTTCCT TAGCAACTTG CAAGCCCTGA CGGAGAGTTC TTTTGCATAA CCCTTGCCTC 120 

TTTCAGATGG ACGGATGGAG TAGCCAATGT GCCCACCTTC TTCTAGTAGA AAGTTACTGA 180 

GGCGC AACCG GAGATTAAGA AATCCAACTG CTTGACCTTT CTCAGAAAAA GCCACTAACT 240 

GAATTGCAGG AACCCATCCT TCAGGCAGAT TAATCCCCAT TTCCTGTTCC TGATTGCTTT 300 

CCAACCACTC TTCATACACA AAATTCTCTG TATCCCAAAA TCCTCCATCA TGGGCTGATT 360 

GGCTCTTTTC AAACTCTGTC ATCATATCTA AAACAGCTTT CTTATCCGCT AATCTTGGTC 420 

TGCGTAATTC CATCCTTACC TCCCACTATG AGAAAAGCGA GAGCTGGCTC TCACTTTTAT 480 

TTTTTCTTGT TCTTGTTTAA AAATTGTTCT CTGAAGCTAA GCAAGCGTTC TTTCTTACCA 540 



(2) INFORMATION FOR SEQ ID NO: 2 54: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 54: 



CCTGCTCCTT 


ATCTTTGCAG TAATTGGCGT 


TTTAGTGGCC 


TTGATAGCTC 


AATTTTACTC 


60 


AGCAAAGGCA 


GCAGTAGGTT TTGCTAAGGA 


ATTGACAAAC 


GATCTTTATC 


GTCATATTCT 


120 


TTCCTTGCCC 


AAGGACAGCA GAGACCGTCT 


GACAACTTCT 


AGTTTGGTTA 


CTCGCTTGAC 


180 


TTCGGATACC 


TACCAGATTC AGACTGGTAT 


CAATCAATTC 


CTGCTGTCTC 


TTTTTACGAG 


240 


CGCCCATATA 


TCGTTTTTGG TGCCATTTTT 


ATGGCTTATC 


GAATCTCAGC 


TGAGTTGACT 


300 


TTCTGGTTCT 


TAATCATGGT TGCCATTTTG 


ACCATTGTCA 


TTGTTAGGGT 


TATCTCGATT 


360 


GGTCAATCCT 


CTCTACAGTA GTCTCAGAAA 


GAAAACCGAA 


CCAACTGGGT 


TCAGGAAACC 


420 


CCCCACCAAT TGCAAGGGAT GCCGGGTTAT 


TCCGTGCCTT 


TGGTCAAGAA 


AAACGATAGT 


480 
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TACAGC 



486 



(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

CTACTCAATG ACGACCAGCA GTCATAACAT CAGCAACACT AATTCCCTTG CCACCTTCTT 
GCCATCCACC TTCAAGTTGA TGAGCAGCAA CAGCACCACC CCATAAAAAT CTATCTTTAA 

AAGTAGTCAT CTTTTTTCCT CCTGACTTTG ATACTCTTAT TATAAACCTC AAACCAAAAG 180 

ATGAAAACGC ATTCTTTTTC CTTATTGTTA AGGAAAGAAG TAATTTTTAA TGGGAAATAG 240 

AACAATATCT TCTTGTATTC TCGTAATGAT ATCTTTACGA TTTTCAATAC TTCCAAACTA 300 

CAAAAACTCT CACAATAATT CTAATTCCCT GTGTCTATAA ACGACTTATC GCTTTCTGGC 360 

ATCCCAGAAT CATCCTCTAT ATAACGTTCA ACTTGCATCT TGGCAAGTGA TATTTTTTTC 420 

TTAAATCTAA GATTTTCTGC ATTGTCTTTG ATTGATGATG CTTATCTAAA GCTTCTTGAT 480 

TTATCCACTG ATCAATAAGG AGAATAGTTC CCTCTTTTTC AATTGGTAAA AAATATTCGT 540 
ATTTCAAGTT ACCTTTTTGA TTTCTAATTT CTTCAACAAG G 



(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

ACTCCTTTGT AATTTTTTAC TTTTCTTCTT TTATGTGTTT AAGATGATCT GGATGGTCAA 
TCTCTAAATC AAAAATCTCT GGAATAGAAC TGTAGTGGAT AATGCACTTG ATACCCAACT 
GATTCATTTT TTGTATGAAA GAAGTATTCA GATAGCCTGC TACAGCAAAA TCAATCTTGT 
TCTTTCTTGC TTTATCCTGC ATATCTCTTA GCATATCTAA CATTATTGGA CTTTCCATAT 
CATGCCATTG ACTGTTTCTC ATAGTCGCAA AAACAAAGGA AGTCAAATCA GTCACTCCAA 
CTACAATCTT TGAAATGCCC GTTTCCAGTA TACTAGATAA GTCAAAATAC GCTGACGGTA 

259 
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ATTCAATCAT CGTTCCGACT TTCCCAGTAA AACCCTGCTG AGGCAATACT GTAATAA 
(2) INFORMATION FOR .SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 448 base pairs 

(B) TYPE: nucleic acid 
<<C) STRANDEDNESS : single 

CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:257: 

CAATCCGATC CCTGAACTCA CTCCTGTAAT GAGAAAACGT TTAGTCATGC AATTCTAACC 
CAATCCGTTG CCAAAACATC ACAAACTGTC GGGCTCCAAC ATGGGAAAAA CCTTCTCCTT 
CGCCAGAAAC GTTGATTAGG AAATAAGGTG TCATTTCAAG TGCAAGCCCA TTTTGCTCGA 
TGGTATCAAA GAGTTGGACA TAGTTTTCCG CACCTCCCCA ACCAGTTCGT ACATATTTCC 240 
TCTTAGCCTT TAACCCAGGC AGGATCTCTT CAAATGTCAT GTTTTCCTCC TTTAATTCTA 300 
CATTCTTCAT TTAATTATAG CAAAAAACCA CTTTATACGG CTTTTTGGAC TGTGAGTTAT 360 
TCAAACCTGG CTACTTACTT ACGGGCAAAT TATTCCCTGC AGGCAAGATT AAATTTCATA 420 
CCATTCTTTT CTTGTTAAGC TAAAGTTT 

448 

(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 679 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 



60 
120 
180 



60 
120 



CTACTTGTAT TGCTGGAAAT GGTATAAGAT GAAAATCTAC AAGTCACTAA TTTTCATATA 

ACCAAAAAAC GAACAGGCTC TTTTTCTGTT ATTCAGAAGA GAACTTGTTC GTTTTTATCT ... 

TGATTTTCTT AAAAAGCTCT ATATACATAG GGATTTTCTG GTTTATCGAC TTTGGTAAAG 180 

CTGTCGATTT CCAAGGTTGG AAGGTTTTGT TTGAGTTCTT TATGGTAGTG ATTGACCAGT 240 

TTTCCTTTGG CTGAAATCCA AGTGTTGTTT TCATACTGCC GGGTATTGCC CTTGGTCAGC 300 

AATCCATAGA CACCAGAATC TGCGATGCAG TGAATAATGC CGAAGCGGAA CAAAAATTGG 360 

CTATTGGCAT GACTGGGGTC ATTATAGACA AAGCCTGTTG AACTGGATTG TCTTAACCCT 420 
CAAATTCATC TGGGATAGTC GTAGATAGCC TCCATGACCT CCCATATAGT TTTCATTAGT 

260 
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GATCTGAATA CTATCTTGGG ATAAGTATTT ATCCGCCGCC GTTCGCATTT CCTTTTCATA 
GGCTGATTTT GAAAAATAAG AACTGGTATC TGGTTTCAAA TATTGGCTTG TCGTCCCTTC 
GCTTGTNTGA ATGGCTAGAT CCGTTCCTTC CGATAGGGGG AAATGATAAC CTTTAGCAGA 
AACGGTCTGA GAATCCAAG 

(2) INFORMATION FOR SEQ ID NO: 259: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:259: 

CTGCTATCAA GACAGTATTA CCGCTGAAAG AAACAGCTTC ACTAAAAAGG CGCAATTGAA 60 

AAATAAGACG GGATCACGAA CTCCCGTTTT TCTATAAAAG AAAGGAAATG GGATGAAAAA 120 

ATTAGTCTTT GTCTGTCTGG GAAATATTTG CCGTACCCCT ATGGCCGAGT TTGTTATGAA 180 

ATCAATGACA GATAACTACG AAATCCAAAG TCGAGCAACT TCCTCTTGGG AACATGGCAA 240 

TCCGATTCAT AAGGGGACTC AGGGAATTTT TCAAGAGTAT GAGATTCCTT ATGACAAGAA 300 

CAAGACATCG CTTCAGATTA GTAAGGAAGA TTTTGAAGCC TTTGATTATA TTATCGGAAT 360 

GGACGCTTCA AATGTTTCCG ACTTACGTCA GATGTGTCCA GTACACTGTC AAGATAAGAT 420 

TTACTCATTT TCATCTGAAA GTGTTCCAGA ACCTTGGTAT ACAGGGAGAT TTTGAAGAAA 480 
CCTATCGACG TGTTCAAAGA AGGCTGTCAA CTTGGTTAGA A 



521 



(2) INFORMATION FOR SEQ ID NO: 260 : 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 515 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

CTAAACGAAT TTCCATCTTT TTTCCTCATT TCTGACTAGT TTCTCCCATC ATAGCATAAT 
TTAAGTATTT CTACAAGCAG TTAAAACTTG ACTTTGTGTT TTTGTTATTT TATAATCTAG 
TTATCAAAAC TATATAAAAA GGAGTTGAAA ATGAAAACTA CCTTTTCCTA CCCAAAATGG 
GCAGAAATTC CAAACATTGA CCTCTATCTG GACCAGGTTT TGCTCTATGT CAATCAGGTC 

261 



(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:262: 

TAACTCTGCT ATAATGGATT TATTCCTTTT TGTGTTTACA CAATTTATTT TATACTACCA 
AAAAAACAAG GTCAGGATTT TGTTCCTGAC CTTTGACAAC TTTACCGATT CTTTAGTTCT 
ACATAGCGCT TGTACCAAAT GTTTACATAG GCTTCTGAGA AAGGACCACG TCCATTGTTA 
ATCCAATCAA CAAGAATTTT GACATGTTCT TTTAAAATAT AGTCCAAGTC ATCAGAATAA 

262 
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TGCGCCCCTA TCTCTCCTAA TAAAGACAAG GGCCTAACAG CATCTATGGT CAACAATTAT 
GTGAAAAATG GTTACCTGAC AAAGCCTGAC AAAAAAAAAT ACCAACGCCA ACAGATTGCC 
CCGTTTGATT GCTATCACAA CCCTCAAGTT CTGTATTTTC CTATTCCAAG AAAATAGCCC 
CAGACACTTA ATACTCCTAC AAACTCCAAG CCAAGTTTCC AGAACCAACT CCTACGAAAG 
CCTTTGTTGG AACTATATGA ACCCAGAATT GACCC 

(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

CTTGTTGGAT ACAATCTTTG AAGAAACACA AAAGGGAGAT GACTAAGATG AAACGATGGA 

TAGCATTAAA TAAGATAGAA TTTCTATTGA CCAAACGGCA ATCAGTCTAT TATTTATTAT A „ 

CAGTAGGGAT GCCGACAGCC TTCTATCTAT TCTTTTCTAG CATGTACCAG GATACTCCAG 180 

GTGGACCCGC GAATTTTATG CGTGATTATC TTATCTCCAT GACTGCCTTT TCTATGATGT 240 

CGACAGCTAT CTTCTCATTT CCAGTTGTTT TACATACCGA CAAGATGAGC AACTGGCAGA 300 

AAACATTACG TCATAGCCCT GTTAATATGG TAGAATATTA CCTATCAAAG ATAACAGTAT 360 

GCTGGGTTGA TTATTTGGTT TTCCATTCTT GGTGGTTTTT CCTCAGTTTG GGCTTTTTGT 420 
TAGAAGGTGT TGGACTTGTT CTCTAACAAA GA 
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TTCATTTTGC GTTTGTGACG CTCGTACTCT 
GTAAAAATTT TAACATCCAA ATCGTAATCA 
TAGGGGCTAG CCATATTGCA ATAGTAAGAA 
TAAACCAATA TTTCTTGTGA AGTAAACAAT 
TCACTTTCGG TAACAATGTA TGATCGTTGA 
CCATTGGTGT CCGCCAAGTC CGTGGAAATC 

(2) INFORMATION FOR SEQ 




PCT/US97/07950 

TCAACGTCCA AGAGACGTTT TTCCCCATCT 300 

ATATACTTCA GTGCTTCTTC ATCCAGATAG 360 

GTCCATTATC ACGAATCATG GCAATGATAT 420 

AGCCGGTTCT CGAGTGACCA ACGACGAACA 480 

CCCAATAATG GCGTTTCTGT TGTTTTAATA 540 



ID NO:263: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 63: 



Met Glu Lys Cys Tyr Thr Asp Val Thr Glu Phe Ala He Pro Ala Ser 

1 5 io 15 

Thr Gin Lys Leu Tyr Leu Ser Pro Val Leu Asp Gly Phe Asn Ser Glu 

20 25 30 

He He Ala Tyr Asn Leu Ser Thr Ser Pro Asn Leu Glu Gin Val Gin 

35 40 45 

Thr Met Leu Glu Gin Ala Phe Thr Glu Lys His Tyr Glu Asn Thr He 

50 55 60 

Leu His Ser Asp Gin Gly Trp Gin Tyr Gin His Asp Ser Tyr His Arg 

65 70 75 80 

Phe Leu Glu Ser Lys Gly lie Gin Ala Ser Met Ser Arg Lys Gly Asn 

85 go 95 

Ser Gin Asp Asn Gly Met Met Glu Ser Phe Phe Gly He Leu Lys Ser 

100 105 no 

Glu Met Phe Tyr Gly Tyr Glu Lys Ser Phe Gin Ser Leu Lys Gin Leu 

115 120 125 

Glu Gin Ala He He Asp Tyr He Asp Tyr Tyr Asn Asn Lys Arg He 

130 135 140 

Lys Val Lys Leu Lys Gly Leu Ser Pro Val Gin Tyr Arg Thr Lys Ser 

145 150 155 ifiO 

Phe Gly 



(2) INFORMATION FOR SEQ ID NO:264: 
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Thr Pro Pro Phe Glu Ala Thr He Lys Asp Gly Trp Val Ser Asp Val 

100 105 HO 

Val Ser Lys Met lie Lys Ala Leu Arg Ser Gin Leu Ser Met Gin 



115 120 

120 125 

(2) INFORMATION FOR SEQ ID NO: 266 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:266: 

Met Ala Glu He Gly Ala His Ala oin Asn Gly G l u Arg Val Leu Thr 

10 

Gly Asp Glu Leu He Asd Phe v=i » 

xie Asp Phe val Asn Asn Lys Leu Phe Lys Glu Leu 



Lys Gly Ala 
35 



20 25 

" 30 



(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:267: 
Met Trp Met Ala Ala Phe Phe Gly Met Ala Thr Lys Tyr Ala Glu Gl y 
- Leu Ala lie Lys Tyr Arg Thr Lys £ Asp His Gly ^ » ^ 
«y «V *o Met His ^ m Leu 1 Gly Met Gly Glu Lys Trp Arg 



40 

45 



265 
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Pro leu Ala Val Leu Phe Ala Val Ala Gly Val Leu Val Ala Leu Leu 

50 55 60 

Gly lie Gly Thr Phe Thr Gin Val Asn Ser lie Thr Glu Ser lie Gin 
65 70 75 ao 

Asn Thr Thr Thr He Ser Pro Ala He Thr Ala Leu Val Leu Ser Val 

85 90 95 

Phe Val Ala lie Ala Val Phe Gly Gly Leu Lys Ser lie Ser Lys Val 

100 "5 110 

Ser Thr Thr Val Val Pro Phe Met Ala lie He Tyr lie Leu Gly Thr 

115 120 125 

Leu Thr Val lie Phe Phe Asn He Gly Lys He Pro Gly Thr He Ala 
130 135 u 0 



Leu Val Phe Thr Ser Ala Phe Ser Pro Leu Ala Ala Val Gly Gly Phe 
145 "0 1S5 

Ala Gly Ala Ser Val Arg Met Ala lie Gin Asn Gly Val Ala Arg Gly 
Val Phe Ser Asn Glu Ser Gly Leu Gly Ser Ala Pro He Ala Ala Ala 

Ser Met Thr 



145 

Arg Met Ala He Gin Asn Gly Val Ala Arg 
165 ™ 175 

Glu Ser Gly Leu Gly Ser Ala Pro He Ala 
IBS 190 
Ala Ala Lys Thr Asn Glu Pro Val Glu Gin Gly Leu He 
195 200 205 

Gly Thr Phe He Asp Thr Leu He He Cys Thr Leu Thr Gly Leu Thr 

210 215 220 

He Leu Val Thr Gly Val Trp Ser Gly Asp Leu Asn Gly Val Ala Leu 
2 " 230 235 240 

Thr Gin Ser Ala Phe Ser Thr Val Phe Ser His Phe Gly Pro Ala Leu 

245 250 255 

Leu Thr He Phe Leu Val Leu Phe Ala Phe Thr Thr He Leu Gly 



260 265 



Trp 

270 

Asn Tyr Tyr Gly Arg Thr Leu Phe 
275 280 



(2) INFORMATION FOR SEQ ID NO:268: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:268: 
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Met Val Gin Val Gly Thr Thr Leu His Lys Glu Gly Val Ser Ala Phe 

1 5 io 15 

Asp Arg Phe Thr Asn Glu Leu Lys Ala He Met Val Glu Lys Gly Tyr 

20 25 30 

Glu Ser Leu Glu Asp Phe Arg Gly Lys Leu Arg Tyr lie Asp 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:269: 

Met Lys Lys Tyr Phe He Gly Gly Leu Gly Ser Asn Ala Tyr His Ser 

1 5 10 15 

Lys Asp Phe Leu Gin Glu Leu Asp Ser Gin Val Tyr Phe Leu Asn Pro 

2° 25 30 

Tyr Glu Lys His Leu Arg Asp Glu Thr Glu Leu Lys Ser Trp Phe Lys 

35 40 45 

Asn Glu lie val Glu Glu Glu Ser lie Cys Leu He Gly His Ser lie 

50 55 60 

Gly Gly Asp Leu Ala Arg Tyr Phe Ala Ser Glu Phe Glu Glu Val Lys 
" 70 75 80 

Lys Leu He Leu Leu Asp Gly Gly Tyr Leu Asp Leu Asp Lys He Leu 

85 90 95 

Pro Leu Asp Thr Glu Leu Glu Glu Thr Lys Asn Tyr He Lys Ser Gin 

100 105 no 

He lie Ser Asp Leu Asp Val Leu Thr Ser Lys Glu Lys Ser Glu Ala 

115 120 12 5 

Lys His Trp Ser Glu Asn Met Glu Lys Ala Val Arg Gin Ser Tyr His 
"0 135 14Q 

Trp Asn Val Glu Tyr Asn Arg Tyr Glu Leu Ala He Asn Tyr Glu Asn 
145 150 155 160 

He Glu Ala He Leu Arg Leu Arg Arg Lys He Gin Ala Phe Lys Arg 

165 170 175 

Glu Val Gly Asp Thr Leu Phe He Ser Pro Arg Tyr Pro Asn Glu Ala 
180 185 190 

267 
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Thr Trp Arg Glu Glu Ala Leu Lys Glu Leu Pro Asp Tyr Phe Asp Thr 

195 200 205 

lie Phe 



210 

(2) INFORMATION FOR SEQ ID NO: 270: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270 : 
Met Lys Leu Glu His Lys Asn lie Phe lie Thr Gly Ser Ser Arg Gly 
He Gly Leu Ala He Ala His Lys Phe Ala Gin Ala Gly Al 



20 25 



a Asn lie 

30 

Val Leu Asn Ser Arg Gly Ala lie Ser Glu Glu Leu Leu Ala Glu Phe 

35 40 45 

Ser Asn Tyr Gly lie Lys Val Val Pro lie Ser Gly Asp Val Ser Asp 

50 55 60 

Phe Ala Asp Ala Lys Arg Met lie Asp Gin Ala lie Ala Glu Leu Gly 
" 70 75 80 

Ser Val Asp Val Leu Val Asn Asn Ala Gly He Thr Gin Asp Thr Leu 

85 90 95 

Met Leu Lys Met Thr Glu Ala Asp Phe Glu Lys Val Leu Lys Val 



100 105 



Asn 



110 

Leu Thr Gly Ala Phe Asn Met Thr Gin Ser Val Leu Lys Pro Met Met 



US 120 



125 

Lys Ala Arg Glu Gly Ala lie He Asn Met Ser Ser Val Val Gly Leu 

130 "5 140 

Met Gly Asn lie Gly Gin Ala Asn Tyr Ala Ala Ser Lys Ala Gly Leu 

" 5 150 155 160 

He Gly Phe Thr Lys Ser Val Ala Arg Glu Val Ala Ser Arg Asn lie 

I 65 170 175 

Arg Val Asn Val He Ala Pro Gly Asn Asp 
180 185 



(2) INFORMATION FOR SEQ ID NO: 271: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 

Met Val Met Gin Arg Leu Leu Leu Lys He Phe Pro Thr Asp Asn Asn 

15 10 15 

Val Val Trp Lys Leu Phe Val Pro Leu Leu Arg Asn Pro Lys He Leu 

20 25 30 

Phe Leu Asp Glu Pro Ala Ala Gly Met Asn Pro Gin Glu Thr Ala Glu 

35 40 45 

Leu Thr Glu Leu He Arg Arg He Lys Asp Glu Phe Lys lie Thr He 

50 55 60 

Met Leu He Glu His Asp Met Asn Leu Val Met Glu Val Thr Glu Arg 
65 70 75 80 

He Tyr Val Leu Glu Tyr Gly Arg Leu lie Ala Gin Gly Thr Pro Asp 

85 90 95 

Glu lie Lys Thr Asn Lys Arg Val lie Glu Ala Tyr Leu Gly Gly Glu 
100 105 no 

Ala 



(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 

Met Thr Arg Tyr Ala Leu Leu Val Arg Gly lie Asn Val Gly Gly Lys 
1 5 10 15 
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Asn Lys Val Val Met Ala Glu Leu Arg Gin Glu Leu Thr Asn Leu Gly 
^ 25 3q 

Leu Glu Lys Val Glu Ser Tyr lie Asn Ser Gly Asn lie Pne Phe Thr 
35 40 



45 



Ser lie Asp Ser Lys Ala Gin Leu Val Glu Lys Leu Glu 



50 55 
Ala Val His Tyr Pro Phe He Pro Glu Leu 
65 70 
Gly Leu 



Thr Phe Phe 



60 



Phe Phe Thr Glu Ser Arg 



75 



80 



(2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: Protein 
<xi) SEQUENCE DESCRIPTION : SEQ ID NO:273: 
Met Glu Gin Phe Leu Asp Asn lie Lys Asp Leu Glu Val Thr Thr Val 

Val Arg Ala Gin Glu Ala Leu Asp Lys Jy S 01u Thr Ala Tnr ^ phe 

20 25 
He Gly Arg Lys Thr Cys Pro Tyr Cys Arg Lys Phe Ala Gly Thr Leu 

35 40 45 

Ser Gly Val Val Ala Glu Thr Lys Ala His l le Tyr Phe lie Asn Ser 

, 50 5 5 60 

Glu Glu Pro Ser Gin Leu Asn Asp Leu Gin Ala Phe Arg Ser Arg Tyr 

70 75 80 

Gly He Pro Thr Val Pro Gly Phe Val w4 e r^ ^ 

u *iy me val His He Thr Asp Gly Thr Asn 

85 90 

Gin Arg Pro Leu Ala Asp Ser Ser Met Ser Ala Gin Glu lie 

100 105 

105 110 

(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 

270 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

Met Tyr Leu lie Leu Pro Phe Cys Val Pro Gly Ser Phe Asp Arg Lys 

1 5 io 15 

Val Leu Val Gly Arg Pro Asp Val Lys Gly Arg Glu Ala lie Leu Lys 

20 25 30 

Val His Ala Lys Asn Lys Pro Leu Ala Glu Asp Val Asp Leu Lys Leu 

35 40 45 

Val Ala Gin Gin Thr Pro Gly Phe Val Gly Ala Asp Leu Glu Asn Val 

50 55 60 

Leu Asn Glu Ala Ala Leu Val Ala Ala Arg Arg Asn Lys Ser He He 
65 70 75 80 

Asp Ala Ser Asp He Asp Glu Ala Glu Asp Arg Val He Ala Gly Pro 

85 go 95 

Ser Lys Lys Asp Lys Thr Val Ser Gin Lys Glu Arg Glu Leu Val Ala 

100 105 no 

Tyr His Glu Ala Gly His Thr He Val Gly Leu Val Leu Ser Asn Ala 

H5 120 i 25 

Arg Val Val His Lys Val Thr He Val Pro Arg Gly Arg Ala Gly Gly 

"0 135 140 

Tyr Xaa He Ala Leu Pro Lys Glu Asp Gin Met Leu Leu Ser Lys Glu 
145 150 155 160 

Asp Met Lys Glu Gin Leu Ala Gly Leu Met Gly Gly Arg Val Ala Glu 

165 170 175 

Lys He He Ser Asn Val Gin Thr Thr Arg Ser Phe Lys Arg Leu 
18° 185 190 

(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 
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Met Lys Gin Val Phe Leu Ser Thr Thr Thr Glu Phe Lys Glu He Asp 

1 5 10 is 

Thr Leu Glu Pro Gly Thr Trp He Asn Leu Val Asn Pro Thr Gin Asn 

20 25 30 

Glu Ser Leu Glu He Ala Asn Thr Phe Asp He Asp He Ala Asp Leu 

35 40 45 

Arg Ala Pro Leu Asp Ala Glu Glu Met Cys Arg He Thr He Glu Asp 

50 55 60 

Glu Tyr Thr Leu He He Val Asp Val Pro Val Thr Glu Glu Arg Asn 
65 70 75 

Asn Arg Thr Tyr 



80 



(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276; 

Met Ala Gin Glu Ala Leu Gin Lys Val Ala Ala Glu Met Gly Val Gly 

15 10 15 

He Lys Val Glu Thr Asn Gly Ala Ser Gly Val Gly Asn Gin Leu Thr 

20 25 30 

Ala Glu Asp He Arg Lys Ala Lys Ala He He He Ala Ala Asp Lys 

35 40 45 

Ala Val Glu Met Asp Arg Phe Asp Gly Lys Pro Leu He Asn Arg Pro 

50 55 6 o 

Val Ala Asp Gly He Arg Lys Thr Glu Glu Leu He Asn Leu Ala He 
65 70 75 80 

Ser Gly Asp Ala Glu Val Tyr Arg Ala Ala Asn Gly Ala Lys Val Ala 

85 90 95 

Thr Ala Ser Asn Glu Lys Gin Ser Leu Gly Gly Ala Phe Tyr Lys His 

100 105 no 

Leu Met Ser Gly Val Ser Gin Met Leu Pro Phe Val He Gly Gly Gly 
H5 120 125 
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lie Met lie Ala Leu Ala Phe Leu He Asp Gly Ala Leu Ser Val Pro 

130 135 140 

Asn Glu Asn Leu Gly Asn Leu Gly Ser Tyr His Glu Phe Ser Phe Tyr 

145 "0 155 leo 

Val Pro 



(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

Met Asp He Trp Glu Lys Met Tyr Glu Glu Ala Gin Lys Leu Tyr Asn 

Pro His Glu Val Ser Asp Phe Val Tyr Ala Asn His Val Val Ala Ala 

20 25 30 

Val Glu Ala Gly Asp Gly Gin lie Phe Thr Gly Phe Cys Met Glu Gly 

35 40 45 

Thr Cys Gly Val Phe His Leu Cys Ala Glu Arg Ala Ala Leu Phe Asn 

50 55 60 

Met Tyr Gin Phe Ser Gly Gin Thr Lys Val Xaa Glu Val Leu Ala Phe 
65 70 75 80 

Arg Asp Lys Pro Pro Tyr Gly Gly Ser Ser Ala Met Pro Leu Arg Cys 
85 90 95 

Leu 



(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

Leu Tyr Leu val Ala Thr Pro He Gly Asn Leu Asp Asp Met Thr Phe 

15 10 15 

Arg Ala lie Gin Thr Leu Lys Glu Val Asp Trp lie Ala Ala Glu Asp 

20 25 30 

Thr Arg Asn Thr Gly Leu Leu Leu Lys His Phe Asp He Ser Thr Lys 

35 40 45 

Gin lie Ser Phe His Glu His Asn Ala Lys Glu Lys Thr Pro Asp Leu 

50 5S 60 

He Gly Phe Leu Lys Ala Gly Gin Ser lie Ala Gin Val Ser Asp Ala 
" 70 75 80 

Gly Leu Pro Ser He Ser Asp Pro Gly His Gly Phe Gly Leu Arg Gin 



85 



90 



95 



Cys 



Leu Leu Glu Gly Arg Asn Cys Ser Cys Tyr Ser Ser Arg Cys Leu 

100 105 110 

Arg Asn Phe Cys Leu Asp Cys Gin Trp Phe Ser Ala Thr Ala Thr Tyr 



H5 120 



125 



Leu Leu Arg Phe Phe Thr Glu Lys lie Arg Ser Thr Glu Ala lie Phe 



130 135 140 



Trp Leu 
145 

(2) INFORMATION FOR SEQ ID NO: 279 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:279: 

Met Pro Asp Asn Leu Ala Leu Arg Met Arg Pro Lys Thr lie Asp Gin 

5 10 15 

Val He Gly Gin Glu Xaa Leu Val Gly Pro Gly Lys lie lie Arg Arg 

20 25 30 

Met Val Glu Ala Asn Arg Leu Ser Ser Met lie Leu Tyr Gly Pro Pro 
35 40 45 
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Gly lie Gly Lys Thr Ser He Ala Ser Ala lie Ala ciy Thr Thr Lys 



Tyr Ala Leu Arg Asp Pro Gin Arg Asp Ser 



60 



« 70 



(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280 : 
Met Gin Asp Glu Val Glu Val Gl y Asp Val phe Lys ^ ^ ^ ^ 

Ala Leu Asn Glu Glu Gly Val Phe He Phe Asp Val His Ser 11 Tyr 
2 0 



25 



Gin Thr Asp Glu Val Leu Pro Trp Leu Phe Leu Pro 



30 



35 40 



(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:281: 

Leu Ser Leu He Asp Asn Asp Pro Phe a 
^ p «ro Phe Lys Asp Asp Glu Asp Phe Val 

10 1c; 
ser Thr ^ ^ Leu „ Lys ^ ^ ^ = ^ ^ ^ ^ ^ 

v.i s« oxu ^ ^ eiy s . r oln ^ fto ^ Lyj ^ £ ^ 



35 40 

«" 45 
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lie Pro Lys Trp Ala Asp Lys Leu Gly Arg Glu Met Gly Leu Asn Phe 
50 55 60 

Ser Gin Thr Leu Thr Asp Ala He Ala Asp Lys Lys Val Gin Ala 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 2 82: 

(i) SEQUENCE CHARACTERISTICS : . 

(A) LENGTH: 220 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 2 82: 



Leu Thr Lys Ser Met Thr Pro Asp Arg Glu Val He Thr Phe He Pro 

1 5 10 15 

Glu Lys Phe He Val Asp Gly Phe Gin Gly He Arg Asp Pro Arg Gly 

20 25 30 

Met Met Gly Val Arg Leu Glu Met Arg Gly Leu Leu Tyr Thr Gly Pro 

35 40 45 

Arg Thr He Leu His Asn Leu Arg Lys Thr Val Glu Arg Ala Gly Val 

50 . 55 eo 

Gin Val Glu Asn Val He He Ser Pro Leu Ala Met Val Gin Ser Val 
65 70 75 80 

Leu Asn Glu Gly Glu Arg Glu Phe Gly Ala Thr Val He Asp Met Gly 

85 90 95 

Ala Gly Gin Thr Thr Val Ala Thr He Arg Asn Gin Glu Leu Gin Phe 

100 105 no 

Thr His He Leu Gin Glu Gly Gly Asp Tyr Val Thr Lys Asp He Ser 

US 120 125 

Lys Val Leu Lys Thr Ser Arg Lys Leu Ala Glu Gly Leu Lys Leu Asn 

130 135 140 

Tyr Gly Glu Ala Tyr Pro Pro Leu Ala Ser Lys Glu Thr Phe Gin Val 
145 150 155 160 

Lys Val He Gly Glu Val Glu Pro Val Lys Val Thr Glu Ala Tyr Leu 

165 170 175 

Ser Glu He He Ser Ala Arg He Lys His He Leu Glu Gin He Lys 

180 185 190 

Gin Glu Leu Asp Arg Arg Arg Leu Val Gly Pro Pro Trp Trp Tyr Cys 
1^5 200 205 
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Leu Asn Arg Trp Glu Cys His Phe Thr Arg Tyr Gly 
210 215 220 

(2) INFORMATION FOR SEQ ID NO:283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283 : 

Met Met Pro Glu Trp Arg Leu Asp Leu He Lys Glu Tyr Tyr Val Pro 

Tyr Met Ser Asn Asp Asn Asn Tyr Pro Arg Val Phe Met Thr Gin Glu 

20 25 30 

Asp Leu Asp Lys lie Ala His lie Glu Ala Asp Met Asn Asp Tyr lie 

35 40 45 

Tyr Arg Lys Arg Ala Glu Trp lie Val Asn Gly Asn lie Asp Thr Glu 

50 55 go 

Trp Asp Asp Tyr Lys Lys Glu Leu Glu Lys Tyr Gly Leu Ser Asp Tyr 
65 70 75 so 

Leu Ala lie Lys Gin Lys Tyr Tyr Asp Gin Tyr Gin Ala Asn Lys Asn 
85 9 0 95 

(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

Met Arg He Ala lie Gly Cys Asp His Phe Val Thr Asp Glu Lys Met 
1 5 10 15 
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Ala Val Ser 

Gly Thr Tyr 
35 

Val Arg Glu 
50 

Cys Gly Thr 
65 

Val Arg Ser 

Glu Gin Leu 

Glu Leu Leu 
115 

Lys Pro Ser 

130 
Glu Thr His 
145 
Pro 



Glu Phe Leu Lys 
20 

Asp His Thr Arg 

Ala Val Thr He 
55 

Gly Val Gly He 
70 

Ala Leu Val Arg 
85 

Asn Ala Asn Val 
100 

Met Cys Asp He 

Glu Glu Asn Lys 
135 

Asn Ala Gin Gin 
150 



Ser Lys Gly Tyr Glu Val He Asp Phe 

25 30 
Thr His Tyr Pro He Phe Gly Lys Lys 
40 45 

Gly Gin Pro Asp Leu Gly Val Cys Thr 
60 

Asn Asn Pro Val Asn Lys Val Pro Arg 

75 80 
Asp Met Thr Thr Ala Leu Tyr Ala Lys 

90 95 
He Gly Phe Gly Gly Lys He Thr Gly 

He Glu Ala Phe He His Ala Glu Tyr 
120 125 

Lys Leu He Ala Lys He Glu His Val 
140 

Thr Asp Ala Asn Phe Phe Thr Glu Xaa 
155 160 



(2) INFORMATION FOR SEQ ID NO: 285 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 

Leu Lys Lys Gin Val Leu Thr Met Xaa He Asp Arg Arg Pro Phe Gly 

15 10 „ 

Gly Ala Asp He Asp Ala Ala Gly Pro Pro Leu Pro Asp Glu Thr Leu 

20 25 30 

Lys Ala Ser Arg Glu Ala Asp Ala He Leu Leu Val Ala He Gly Ser 

35 40 45 

Pro Gin Tyr Asp Gly Val Ala Val Arg Pro Glu Gin Gly Leu Met Ala 

50 55 60 

Leu Arg Lys Asn Ser He Phe Thr Leu He Phe Val Leu 
65 70 75 
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(2) INFORMATION FOR SEQ ID NO: 2 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286 : 

Met Ala Phe He Glu Lys Gly Gin Glu lie Asp Met Glu Val He Lys 

15 10 15 

Ala Glu Thr Gin Leu Ser Ala Glu Ala Leu Arg Leu Lys Glu Ser Arg 

20 25 3 0 

Asp Arg Glu Leu Ala Asp lie lie Ser Gly Glu Asp Asp Arg lie Leu 

35 40 45 

Leu Val He Gly Pro Cys Ser Ser Asp Asn Asp 
50 55 



(2) INFORMATION FOR SEQ ID NO: 2 87: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

Met Val Met Arg Val Tyr Thr Ala Lys Pro Arg Thr Asn Gly Asp Gly 

1 5 10 15 

TVr Lys Gly Leu Val His Gin Pro Asp Thr Ser Lys Ala Pro Thr Leu 

20 25 30 

lie Asn Gly Leu Gin Ala Val Arg His Leu His Tyr Arg Phe Asp Tyr 

35 40 45 

Arg Tyr Trp Phe Asp Asn Gly Arg 
50 55 
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(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

Met Asn Lys Arg Val Lys He Val Ala Thr Leu Gly Pro Ala Val Glu 

1 5 10 15 

He Arg Gly Gly Lys Lys Phe Gly Glu Asp Gly Tyr Trp Gly Glu Lys 

20 25 30 

Leu Asp Val Glu Ala Ser Ala Lys Asn He Ala Lys Leu He Glu Ala 

35 40 45 

Gly Ala Asn Thr Phe Arg Phe Asn Phe Ser His Gly Asp His Gin 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289 : 

Met Gly Asp Lys Pro He Ser Phe Arg Asp Ala Asp Gly Asn Phe Val 

1 5 io 15 

Ser Ala Ala Asp Val Trp Asn Glu Lys Lys Leu Glu Glu Leu Phe Asn 

20 25 30 

Arg Leu Asn Pro Asn Arg Ala Leu Arg Leu Ala Arg Thr Lys Lys Glu 

35 40 45 

Asn Pro Ser Gin 
50 

(2) INFORMATION FOR SEQ ID NO: 290: 
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He lie Val He Thr Pro Gin Ser Thr Gly Pro Gin Ser lie Leu Phe 

500 505 510 

Trp He Asp Ala Val Gin Asn Tyr Val Leu Phe Asn Gin Leu Ser Asp 

515 520 525 

Ala Gin Glu Leu lie Gin Arg Gin Gly He Glu Asn Trp Val Ser Glu 

530 535 540 

Met Gin Thr Gly Tyr His Asn Tyr lie Thr Leu Leu Asp Asn lie Gin 
545 550 

555 560 



Arg Glu Arg Trp Val Met Leu Ala Gly Ala Val Leu Gly He Ala Thr 

565 570 575 

Ser He Leu Leu Phe Asn Thr Met Asn Arg Leu Tyr Phe Glu Glu Phe 

580 585 590 

Arg Arg Ala He Phe He Lys Arg He Ala Gly Leu Arg Phe Leu Glu 

595 "0 605 

He His Arg Thr Tyr Leu Phe Ala Gin Leu Gly Val Phe Leu Leu Gly 



610 "5 620 



Phe Val Ala Ser Val Phe Leu Gin Val Glu He Gly Val Ala Phe Leu 



625 630 



"5 640 
Val Leu Leu Leu Phe Thr Gly Leu Ser Leu Leu Gin Leu His Val Gin 

645 «0 655 

Met Gin Lys Glu Asn Lys Met Ser He Leu Val Leu Lys Gly Gly 
660 «65 670 

(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 

Met Lys Glu Lys Glu He Phe Asp Ser He Val Thr He He Gin Glu 

Arg Gin Gly Glu Asp Phe Val Val Thr Glu Ser Leu Ser Leu Us Asp 

^ 25 20 

Asp Leu Asp Ala Asp Ser Val Asp Leu Met Glu Phe He Zu Thr Leu 

35 40 « 

Glu Asp Glu Phe Ser He Glu He Ser Asp Glu Glu He Asp Gin Leu 

50 55 60 
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Gin Ser Val Gly Asp Val Val Lys He He Gin Gly Lys 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 

Met Arg Ser Asp Phe Leu Arg Asn Lys He Trp Tyr Ser Ser Ser Met 

15 10 15 

Asn Asp Glu Ala Ser Lys Gin Leu Thr Asp Ala Arg Phe Lys Arg Leu 

20 25 30 

Val Gly Val Gin Arg Thr Thr Phe Glu Glu Met Leu Ala Val Leu Lys 

35 40 45 

Thr Ala Tyr Gin Leu Lys His Ala Lys Gly Gly Arg Lys Pro Lys Leu 

50 55 €0 

Ser Leu Glu Asp Leu Leu Met Ala Thr Leu Gin Tyr Val Arg Glu Tyr 
65 7 ° 75 80 

Arg Thr Tyr Glu Glu He Ala Ala Asp Phe Gly He His Glu Ser Thr 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 304 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:304: 

Met Lys Leu Phe Lys Met Ser Cys Arg Asn He Gly Gin Ala Gly Lys 
15 10 15 
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lie Leu Ala Asp Ser Asp Tyr Gin Gly Leu Met Lys He Tyr Pro Gin 

20 25 30 

Ala Gin Thr Pro Arg Lys Ser Ser Lys Leu Lys Pro Leu Thr Ala Glu 

35 40 45 

Asp Lys Ala Tyr Asn His Ala Leu Ser Lys Glu Arg Ser Lys Val Glu 

50 55 60 

Asn He Phe Ala Lys Val Lys Thr Phe Lys Met Phe Ser Thr Thr Tyr 
65 70 75 so 

Arg Asn His Arg Lys Arg Phe Gly Leu Arg Met Asn Leu He Ala Gly 

85 go 95 

He He Asn His Glu Leu Gly Phe 
100 



(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:305: 

Leu Arg Lys Val Val Val Val Arg Leu Glu Lys Pro Val Thr Phe His 

1 5 10 15 

Asn Met He Ala Pro Asp Lys Glu Val Glu Val Ser Leu Leu Phe Phe 

20 25 30 

He He Asn Asn Ser Ser Ser Ser Gin Thr Asn He Leu Ala Gin Leu 

35 40 45 

Met Asp Phe Phe Thr Gly Asn Gly His Leu Glu Asp Leu Ser Lys He 

50 55 60 

Ser Glu Pro Glu Lys Leu Tyr Ala Tyr He Ala Glu Ala Thr Ala 
65 70 75 

(2) INFORMATION FOR SEQ ID NO:306: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

Met He Lys He Leu Ala Ala Cys Gly Ala Gly Val Asn Ser Ser His 

15 10 15 

Gin He Lys Ser Ala Leu Glu Glu Glu Leu Ser Asn Arg Gly Tyr Asp 

20 25 30 

Val His Cys Asp Ala Val Met Val Lys Asp Val Asn Glu Asp Leu Met 

35 40 45 

Lys Gly Tyr Asp He Phe Thr Pro He Ala Ala Thr Asp Leu Gly Phe 

50 55 60 

Glu Pro Gly He Pro Val He Glu Ala Gly Pro He Leu Phe Arg He 
65 70 75 so 

Pro Ala Met Ser Ala Pro Val Phe Asp Asn He Arg Leu Pro Ala Lys 
85 90 95 

Gin Asn Met Val 
100 

(2) INFORMATION FOR SEQ ID NO: 3 07: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 

Met Val Val Ser Phe He Leu Leu Met Ala Thr He Val Val Glu Pro 

1 5 10 15 

Leu Glu Gly He Phe His Val Thr Lys Leu Asp Leu Ser Gin Trp Gly 

20 25 30 

He Val Met Ala Gly Ser Phe Ser Met He He He Val Glu He Ala 

35 40 45 

Lys Phe He Gin Arg Lys Leu Gly Phe Asp Lys Asn Ala He 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 308: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 158 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 

Met Asn His Lys Pro Arg Gly Arg Lys Ala Ser Phe Phe Ser Gly Gly 

15 10 15 

Val Leu Ser Ser lie lie Tyr Gin Gly Val Leu Gin Ala Ala Leu Val 

20 25 30 

Met Ser Val Tyr Gly Leu Ala lie Ala Tyr Pro Val His Val Gly Asp 

35 40 45 

Asn His Ala lie His Ala Asp Ala Leu Thr Met Ala Phe Ala Thr Leu 

50 55 60 

Gly Leu He Gin Leu Phe His Ala Tyr Asn Val Lys Ser Val Tyr Gin 
65 70 75 bo 

Ser lie Leu Thr Val Gly Pro Phe Lys Ser Lys Thr Phe Asn Trp Ser 

85 90 95 

He Leu Val Ser Phe He Leu Leu Met Ala Thr lie Val Val Glu Pro 

100 105 110 

Leu Glu Gly He Phe His Val Thr Lys Leu Asp Leu Ser Gin Trp Gly 

115 120 12 5 

He Val Met Ala Gly Ser Phe Ser Met He He He Val Glu He Val 

130 135 140 

Lys Phe lie Gin Arg Lys Leu Gly Phe Asp Lys Asn Ala He 

155 



145 150 



(2) INFORMATION FOR SEQ ID NO: 309; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 
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Met Ala His Gin Ala Leu Arg Val Leu Ala Gly Ala Tyr Lys He He 

15 10 15 

Asp Ser He Pro Glu Asn Leu Thr Ser Glu Glu Leu Glu Asn Asp Leu 

20 25 30 

He Phe Thr Gly Leu He Gly Met lie Asp Pro Glu Arg Pro Glu Ala 

35 40 45 

Ala Glu Ala Val Arg Val Ala Lys Glu Ala Gly He Arg Pro He Met 

50 55 60 

He Thr Gly Asp His Gin Asp Thr Ala Glu Ala He Ala Lys Arg Leu 
65 70 75 80 

Gly He He Asp Ala Asn Asp Thr Glu Gly His Val Leu Thr Gly Ala 

85 90 95 

Glu Leu Asn Glu Leu Ser Asp Glu Glu Phe Glu Lys Val Val Gly Gin 

100 105 HO 

Tyr Ser Val Tyr Ala Arg Val Ser Pro Glu His Lys Val Arg He Val 

115 120 125 

Lys Ala Trp Gin Lys Gin Gly Lys Val Val Ala Met Thr Gly Asp Gly 

130 135 140 

Val Asn Asp Ala Pro Ala Leu Lys Thr Ala Asp He Gly He Gly Met 
145 150 155 160 

Gly He Thr Gly Thr Glu Val Ser Lys Gly Ala Ser Asp Met He Leu 

165 170 175 

Ala Asp Asp Asn Phe Ala Thr He He Val Ala Val Glu Glu Gly Arg 

180 185 190 

Lys Val Phe Ser Asn He Gin Lys Thr He Gin Tyr Leu Leu Ser Ala 

195 200 205 

Asn Thr Ala Glu Val Leu Thr He Phe Leu Ser Thr Leu Phe Gly Trp 

210 215 220 

Asp Val Leu Gin Pro Val His Leu Leu Trp He Asn Leu Val Thr Asp 
225 230 235 240 

Thr Phe Pro Ala He Val Leu Trp Cys 



(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 498 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



245 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:310: 

Met Lys Gin Ser Lys Met Pro He Pro Thr Leu Arg Glu Met Pro Ser 

1 5 10 15 

Asp Ala Gin Val He Ser His Ala Leu Met Leu Arg Ala Gly Tyr Val 

20 25 30 

Arg Gin Val Ser Ala Gly Val Tyr Ser Tyr Leu Pro Leu Ala Asn Arg 

35 40 45 

Val lie Glu Lys Ala Lys Asn He Met Arg Gin Glu Phe Glu Lys He 

50 55 60 

Gly Ala Val Glu Met Leu Ala Pro Ala Leu Leu Ser Ala Glu Leu Trp 
65 70 75 bo 

Arg Glu Ser Gly Arg Tyr Glu Thr Tyr Gly Glu Asp Leu Tyr Lys Leu 

85 90 95 

Lys Asn Arg Glu Lys Ser Asp Phe He Leu Gly Pro Thr His Glu Glu 

100 io 5 11(J 

Thr Phe Thr Ala He Val Arg Asp Ser Val Lys Ser Tyr Lys Gin Leu 

115 120 125 

Pro Leu Asn Leu Tyr Gin He Gin Pro Lys Tyr Arg Asp Glu Lys Arg 

130 "5 140 

Pro Arg Asn Gly Leu Leu Arg Thr Arg Glu Phe He Met Lys Asp Ala 
145 "0 155 160 

Tyr ser Phe His Ala Asn Tyr Asp Ser Leu Asp Ser Val Tyr Asp Glu 

165 170 175 

Tyr Lys Ala Ala Tyr Glu Arg He Phe Thr Arg Ser Gly Leu Asp Phe 

180 185 190 

Lys Ala He He Gly Asp Gly Gly Ala Met Gly Gly Lys Asp Ser Gin 

195 200 205 

Glu Phe Met Ala He Thr Ser Ala Arg Thr Asp Leu Asp Arg Trp Val 

210 215 220 

Val Leu Asp Lys Ser Val Ala Ser Phe Asp Glu He Pro Ala Glu Val 
225 230 235 240 

Gin Glu Glu lie Lys Ala Glu Leu Leu Lys Trp He Val Ser Gly Glu 

245 250 255 

Asp Thr lie Ala Tyr Ser Ser Glu Ser Ser Tyr Ala Ala Asn Leu Glu 

260 265 270 

Met Ala Thr Asn Glu Tyr Lys Pro Ser Asn Arg Val Val Ala Glu Glu 

275 280 285 

Glu Val Thr Arg Val Glu Thr Pro Asp Val Lys Ser He Asp Glu Val 

290 295 300 

Ala Ala Phe Leu Asn Val Pro Glu Glu Gin Thr He Lys Thr Leu Phe 

305 310 -jic 

AW 315 320 
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Tyr lie Ala Asp Gly Glu Leu Val Ala Ala Leu Leu Val Gly Asn Asp 
325 330 335 

Gin Leu Asn Glu Val Lys Leu Lys Asn His Leu Gly Ala Asn Phe Phe 

340 345 350 

Asp Val Ala Ser Glu Glu Glu Val Ala Asn Val Val Gin Ala Gly Phe 

355 360 365 

Gly Ser Leu Gly Pro Val Gly Leu Pro Glu Asn He Lys lie He Ala 

370 375 380 

Asp Arg Lys Val Gin Asp Val Arg Asn Ala Val Val Gly Ala Asn Glu 
385 390 395 400 

Asp Gly Tyr His Leu Thr Gly Val Asn Pro Gly Arg Asp Phe Thr Ala 

405 410 415 

Glu Tyr Val Asp He Arg Glu Val Arg Glu Gly Glu He Ser Pro Asp 

420 425 430 

Gly Gin Gly Val Leu Asn Phe Ala Arg Gly He Glu He Gly His He 

435 440 445 

Phe Lys Leu Gly Thr Arg Tyr Ser Ala Ser Met Gly Ala Asp Val Leu 

450 455 460 

Asp Glu Asn Gly Arg Ala Val Pro He He Met Gly Cys Tyr Gly He 
465 470 475 480 

Gly Val Ser Arg Leu Leu Ser Ala Val Met Glu Gin His Ala Arg Leu 
485 490 495 

Phe Val 



(2) INFORMATION FOR SEQ ID NO: 311:. 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 44 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 
Met Gly lie lie Arg He Gly Ala Glu Val Lys Glu Gly Asp lie Leu 



Val Gly Lys Val Thr Pro Lys Gly Glu Lys Asp Leu Ser Ala Gly Arg 



1 



5 



10 



15 



20 25 
Thr Ser Leu Ala Arg Tyr Leu Trp Arg Gin Val Ser 
35 40 
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(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

Met Arg Val Leu Asp Glu Asp Asp Gin Glu Val Glu Leu Arg Asp Leu 

1 5 10 15 

Asp Glu Gly Met Asp Glu Asp Val He His Val Asp Asp Leu Glu Lys 

20 2 5 30 

Ala Arg Glu Lys Ala Ala Gin Glu Ala Lys Ala Ala Phe Glu Ala Glu 

35 40 45 

Glu Ala Glu Lys Ala Thr Lys Ala Glu Ala Thr Glu Glu Ala Ala Glu 

50 55 60 

Gin Glu 
65 

(2) INFORMATION FOR SEQ ID NO: 313: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 364 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313 : 

Met Leu val Asn Val Val Pro Leu Pro Ser Gin Pro Thr Trp Arg Val 

15 10 15 

Val Val Pro Thr Ser Ser Leu Val Lys Val Phe Val Asn Leu Glu Asp 

20 25 30 

Phe Val Leu Leu Val Pro Glu Arg His Glu Ser Arg Arg lie Asp Asn 
35 40 45 
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Gin Leu Arg Gly Arg Ser Gly Arg Gin Gly Asp Pro Gly Glu Ser Gin 
50 55 60 

Phe Tyr Leu Ser Leu Glu Asp Asp Leu Met Lys Arg Phe Gly Ser Glu 
65 70 75 80 

Arg Leu Lys Gly lie Phe Glu Arg Leu Asn Met Ser Glu Glu Ala He 

85 90 95 

Glu Ser Arg Met Leu Thr Arg Gin Val Glu Ala Ala Gin Lys Arg Val 

100 105 110 

Glu Gly Asn Asn Tyr Asp Thr Arg Lys Gin Val Leu Gin Tyr Asp Asp 

115 120 125 

Val Met Arg Glu Gin Arg Glu lie lie Tyr Ala Gin Arg Tyr Asp Val 
130 i3 5 140 

He Thr Ala Asp Arg Asp Leu Ala Pro Glu He Gin Ser Met He Lys 
145 150 155 160 

Arg Thr He Glu Arg Val Val Asp Gly His Ala Arg Ala Lys Gin Asp 

165 170 175 

Glu Lys Leu Glu Ala He Leu Asn Phe Ala Lys Tyr Asn Leu Leu Pro 

180 185 190 

Glu Asp Ser He Thr Met Glu Asp Leu Ser Gly Leu Ser Asp Lys Ala 

185 200 205 

He Lys Glu Glu Leu Phe Gin Arg Ala Leu Lys Val Tyr Asp Ser Gin 

210 215 220 

Val Ser Lys Leu Arg Asp Glu Glu Ala Val Lys Glu Phe Gin Lys Val 
225 230 235 240 

Leu He Leu Arg Val Val Asp Asn Lys Trp Thr Asp His He Asp Ala 

245 250 255 

Leu Asp Gin Leu Arg Asn Ala Val Gly Leu Arg Gly Tyr Ala Gin Asn 

260 265 270 

Asn Pro Val Val Glu Tyr Gin Ala Glu Gly Phe Arg Met Phe Asn Asp 

275 280 285 

Met He Gly Ser He Glu Phe Asp Val Thr Arg Leu Met Met Lys Ala 

290 295 3 00 

Gin He His Glu Gin Glu Arg Pro Gin Ala Glu Arg His He Ser Thr 

305 310 „. 

JAW 315 320 

Thr Ala Thr Arg Asn He Ala Ala His Gin Ala Ser Met Leu Glu Asp 

325 330 335 

Leu Asp Leu Ser Gin He Gly Arg Asn Glu Leu Cys Pro Cys Gly Ser 

340 345 3 5 o 

Gly Lys Lys Phe Lys Asn Cys His Gly Lys Arg Gin 
355 360 



(2) INFORMATION FOR SEQ ID NO: 3 14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 

tC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 

Met Ala Arg Gin Tyr Lys Val Asp Gly He Pro Ala Gly He Val Ser 

1 5 10 15 

Leu Ser Ser Phe lie Thr Val Thr Pro Phe lie Thr Gly Glu Ala Gly 

20 25 30 

Ala Gly Met Pro Thr Ala Phe Met Ala Ser Lys Gly Leu Phe Val Ala 

35 40 45 

Met He Leu Gly Leu He Asn Gly Tyr He Tyr Gin Trp Phe lie Asn 

50 55 60 

His Asn He Gin He Lys Met Pro Asp Gly Val Pro Pro Ala Val Ser 
65 70 75 so 

Lys Ser Phe Ser Ala He He Pro Gly Ala Val Thr He Val Gly Trp 

B5 90 95 

Leu He Val Tyr Ala Thr Leu Asp Lys Leu Ser Leu Pro Asn Leu His 

100 105 110 

Glu He Ala Gin Val Ser Phe Gly Arg Ser Thr Trp Thr Phe Arg Lys 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 315: 

Met Leu Glu Leu Arg Asn He Asn Lys Val Phe Gly Asp Lys Gin He 

1 5 10 15 

Leu Ser Asp Phe Ser Leu Ser He Pro Glu Lys Gin He Leu Ala He 
20 25 30 
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Val Gly 

Gly Leu 

50 
Leu Glu 
65 

Gin Asp 

Leu Ser 

Lys Ala 

Ser Tyr 
130 
Ala Arg 
145 

Thr Ser 
Leu Gin 
Leu Gin 



Pro Ser Gly Gly Ser Lys Thr Thr Leu Leu Arg Met Leu Ala 

35 40 45 

Glu Thr He Asp Ser Gly Gin He Phe Tyr Asn Gly Gin Pro 

55 60 
Leu Asp Glu Leu Gin Lys Arg Asn Leu Leu Gly Phe Val Phe 

70 75 80 

Phe Gin Leu Phe Pro His Leu Ser Val Leu Glu Asn Leu Thr 

85 90 95 

Pro Val Lys Thr Met Gly Met Lys Gin Glu Glu Ala Glu Lys 

100 105 110 

Ser Gly Leu Leu Glu Gin Leu Gly Leu Gly Gly His Ala Glu 
H5 120 125 

Pro Phe Ser Leu Ser Gly Gly Gin Lys Gin Arg Val Ala Leu 

135 140 
Ala Met Met He Asp Pro Glu He lie Gly Tyr Asp Glu Pro 
150 155 160 

Ala Leu Asp Pro Glu Leu Arg Leu Glu Val Glu Lys Leu lie 

165 170 175 

Asn Arg Glu Leu Gly Met Thr Gin lie Val Val Thr His Asp 

180 185 190 

Phe Ala Glu Asn He Ala Met Tyr Tyr 
195 200 



(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 

Met Thr Ala Leu Phe Leu Val Ala. Cyz Gly Lys Asn Ser Ser Glu Thr 

1 5 io 15 

Ser Gly Asp Asn Trp Ser Lys Tyr Gin Ser Asn ?.ys Ser He Thr He 

20 25 30 

Gly Phe Asp Ser Thr Phe Ser Ser Gin Trp Asp Leu Leu Arg Lys Met 

35 40 45 

Val Leu Met Gin Asp Leu He Leu He 
50 55 
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(2) INFORMATION FOR SEQ ID NO : 317: 

(i) SEQUENCE CHARACTERISTICS: 1 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

Met Thr Gin Ala lie Leu Glu lie Lys His Leu Lys Lys Ser Tyr Gly 

Gin Asn Glu Val Leu Lys Asp lie Ser Leu Thr Val His Lys Gly Glu 

20 25 30 

Val He ser lie lie Gly Ser Ser Gly Ser Gly Lys Ser Thr Phe Leu 

" 40 « 

Arg ser lie Asn Leu Leu Glu Thr Pro Thr Asp Gly Gin lie Leu Tyr 

50 55 60 

His Gly Gin Asn Val Leu Glu Lys Gly Tyr Asp Leu Thr Gin Tyr Arg 

70 " 80 

Glu Lys Leu Gly Met Val Phe Gin Ser Phe Asn Leu Phe Glu Asn Leu 

85 90 95 

Asn val Leu Glu Asn Thr lie Val Ala Gin Thr Thr Val Leu Lys Arg 

105 no 

Glu Arg Thr Glu Ala Glu Glu lie Ala Lys Glu Asn Leu Glu Lys Val 

115 120 125 

Gly Met Gly Glu Arg Tyr Trp Gin Leu Pro Asn Gin lie Ala Thr lie 

130 "5 14Q 

Arg Trp Ser Lys Thr Thr Cys Gly His Arg Ser Cys Pro He Asn Glu 

150 "5 160 

Ser Arg His Leu Cys Ser Leu Met Asn Gin Leu Leu Pro Leu Thr Leu 



165 

Arg Trp Leu Glu Lys 
180 



170 



175 



(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318 : 



Met Leu Phe Asp Glu Pro Thr Ser 

1 5 
Glu Val He Asn Val Met Lys Glu 
20 

He He Val Thr His Glu Met Gly 

35 40 
Val He Phe Thr Ala Asp Gly Glu 

50 55 
Gin He Phe Asp Asn Pro Gin His 
65 70 
Lys Val Leu Asn Val 
85 



Ala Leu Asp Pro Glu Met Val Gly 

10 15 
Leu Ala Glu Gin Gly Met Thr Met 
25 30 
Phe Ala Arg Gin Val Ala Asn Arg 
45 

Phe Leu Glu Asp Gly Thr Pro Asp 
60 

Pro Arg Leu Lys Glu Phe Leu Asp 
75 80 



(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319 : 

Met Ala Lys Asp He Leu Val Gin Ala Met Gin Arg He Ala Gly Glu 

1 5 io 15 

Tyr Val Ala Glu Ser Thr Asn Ser Thr Val His Leu Pro Asp Asp Thr 

20 25 30 

Met Lys Gly Arg He lie Gly Arg Glu Gly Arg Asn He Arg Thr Phe 

35 40 45 

Glu Ser Leu Thr Gly Val Asp Val He He Asp Asp Thr Pro Glu Val 

50 55 60 

Val Thr Leu Ser Gly Phe Asp Pro He Arg Arg Glu He Ala Arg Met 
65 70 75 80 
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Thr Met Glu Met Leu Leu Lys Asp Gly Arg He His Pro Ala Arg He 

85 90 95 

Glu Glu Leu Val Glu Lys Asn Arg Gin Glu He Asp Asn Lys He Arg 

100 105 no 

Glu Tyr Gly Glu Ala Ala Ala Tyr Glu He Gly Ala Pro Asn Leu His 

115 120 125 

Pro Asp Leu Met Lys He Met Gly Arg Leu Gin Phe Arg Thr Ser Tyr 

130 135 i 4 o 

Gly Gin Asn Val Leu Arg His Ser He Glu Val Ala Lys Leu Ala Gly 
145 "0 155 160 

He Met Ala Ser Glu Leu Gly Glu Asn Ala Ala Leu Ala Arg Arg Ala 

165 170 i7 5 

Gly Phe Leu His Asp He Gly Lys Ala He Asp His Glu Val Glu Gly 

180 185 190 

Ser His Val Glu He Gly Met Glu Leu Ala Arg Lys Tyr Lys Glu Pro 

195 200 205 

Pro Val Val Val Asn Thr He Ala Ser His His Gly Asp Val Glu Ala 

210 215 220 

Glu Ser Val He Ala Val He Val Ala Ala Ala Asp Ala Leu Ser Ala 
225 230 235 240 

Ala Arg Pro Gly Ala Arg Ser Glu Ser Leu Glu Ser Tyr He Lys Arg 

245 250 255 

Leu His Asp Leu Glu Glu He Ala Asn Gly Phe Glu Gly Val Gin Thr 

260 265 270 

Ser Phe Ala Leu Gin Ala Gly Arg Glu He Arg He Met Val Asn Pro 

275 280 285 

Gly Lys He Lys Asp Asp Lys Val Thr He Leu Ala His Lys Val Arg 

290 295 300 

Lys Lys He Glu Asn Asn Leu Asp Tyr Pro Gly Asn He Lys Val Thr 
305 310 315 320 

Val He Arg Glu Leu Arg Ala Val Asp Tyr Ala Lys 
325 33 0 



(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:320: 

Met Lys Ser Ser Gin Glu Ala Ala Glu Leu Met Leu Leu Asn Ala Glu 

15 10 15 

Gin Glu Ala Thr Asn Leu Arg Gly Gin Ala Glu Arg Glu Ala Asp Leu 

20 25 30 

Leu Val Asn Glu Ala Lys Arg Glu Ser Lys Ser Leu Lys Lys Glu Ala 

35 40 45 

Leu Leu Glu Ala Lys Glu Glu Ala Arg Lys Tyr Arg Glu Glu Val Asp 

50 55 60 

Ala Glu Phe Lys Ser Glu Arg Gin Glu Leu Lys Gin He Glu Ser Arg 
65 70 75 80 

Leu Thr Glu Arg Ala Thr Ser Leu Asp Arg Lys Asp Asp Asn Leu Thr 

85 90 95 

Ser Lys Glu Gin Thr Leu Glu Gin Lys Arg Thr Lys Tyr Phe Leu He 

100 105 110 

Glu Arg Lys Thr Leu Met Arg Val Lys Ser Asn 
115 120 

(2) INFORMATION FOR SEQ ID NO:321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDKESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

Met Tyr Asp Phe Lys Gly Lys Lys Phe Leu Gin He Leu He He He 

15 10 15 

Ala Ser Met Ser Thr Pro Phe Val Gly Ala Tyr Ser Trp He Leu Leu 

20 25 3 o 

Leu Gly Arg Asn Glu Val He Thr Lys Phe Leu Thr Asn Ala Leu Tyr 

35 40 45 

Leu Pro Ala He Asp He Tyr Gly Phe Lys Gly He He Leu Val Phe 

50 55 60 

Thr Leu Gin Leu Phe Pro Leu Val Phe Leu Tyr Val Ala Gly Thr Met 
65 70 75 80 

Asn Ser He Asp Asn Ser Leu Leu Glu Ala Ala Glu Ser Met Gly Ser 
85 90 95 
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Phe Gly Phe Lys Pro He Val Thr Val Val Leu Pro Leu Leu Val Pro 

100 105 110 

Thr Leu Leu Ala Ala Pro Cys Leu Tyr Leu 
115 120 

(2) INFORMATION FOR SEQ ID NO: 322 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 

Met Arg Ala Phe Ser Asp Phe Gly Thr Pro Met Leu He Gly Glu Gly 

15 10 is 

Tyr Arg Thr Phe Pro Val Leu lie Tyr Thr Gin Phe lie Ser Glu Val 

20 25 30 

Gly Gly Asn Ser Ala Phe Ala He Met Ala He lie He Ala Leu Ala 

35 40 45 

He Phe Leu He Gin Lys His He Ala Asn Arg Tyr Ser Phe Ser Met 
50 55 60 



Asn Leu Leu His Pro He Glu Pro Lys Lys Thr Thr Lys Gly Lys Met 
" 70 75 80 

Ala Ala lie Tyr Ala Thr Val Tyr Gly He He Phe He Ser Val Leu 



85 9o 



95 

Pro Gin He Tyr Leu He Tyr Thr Ser Phe Leu Lys Thr Ser Gly Met 

100 105 110 

val Phe Val Lys Gly Tyr Ser Pro Asn Ser Tyr Lys Val Ala Phe Asn 

H5 120 125 

Arg Met Gly Ser Ala He Phe Asn Thr He Arg He Pro Leu He Ala 

130 "5 140 

Leu Val Leu Val Val Pro He Tyr Asp He Tyr Leu Leu Pro Ser Arg 
" 5 150 155 160 

(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 

Met Glu Ser Asn Ser Ser Leu Thr Leu Leu Lys His Gly Ser Glu Val 

15 10 15 

Asn Leu Glu Arg Ala Leu Ser Val Asn Gly Arg Leu Gly Gly His Val 

20 25 30 

Val Thr Gly His lie Asp Gly Thr Gly Lys He Ser Ser He Lys Lys 

35 40 45 

Asp Asp Asn Ala Val Trp Tyr Gin He Asn Thr Gin Lys Glu He Leu 

50 55 60 

Asp Leu He Val Glu Lys Gly Ser He Thr He Asp Gly He Ser Leu 
65 70 75 80 

Thr Val Ala Lys Val Ser Lys Val Asn Phe Ser Val Ser Val He Pro 

85 90 95 

His Thr Leu Lys Gin Thr He Leu Lys Ser Lys Gin Val Gly Ser Thr 

100 105 no 

Val Asn Leu Glu Asn Asp He Leu Gly Lys Tyr Val Gin Lys Leu Met 

115 120 125 

Asp Asn Ser Pro Lys Ser Glu He Ser Lys Glu Leu Leu Tyr Gin Asn 

130 135 140 

Gly Phe 
145 

(2) INFORMATION FOR SEQ ID NO: 324 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324 : 

Met Ser Arg Gin Thr Pro Ser Leu Ser Phe Glu Val Phe Pro Pro Asn 
1 5 io 15 
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Pro Ala Val Gly Asn Asp Asn lie He Ser Ala Leu Gin Asp Met Gin 

20 25 3p 

Glu Leu Ala Pro His Phe He Ser Val Thr Ala Ser Asn Asn Lys Phe 

35 40 45 

Asn He Lys Glu Thr Thr Val Arg Leu Ala Asp Phe He Gin Asn Asp 

50 55 60 

Leu Ala He Pro Thr He Ala His Leu Pro Ala He Tyr Leu Thr Lys 
65 ™ 75 80 

Asp Lys Val Ala Glu Thr He Ala Asp Leu Asp Lys Val Gly Val Gin 

85 90 95 

Lys lie Leu Ala Leu Arg Gly Asp He He Pro Asp Val Glu Pro Gin 

100 105 no 

Lys Asp Phe Arg Tyr Ala Thr Asp Leu He Glu Phe He Lys Glu Gin 

115 120 125 

Thr Pro His Phe Asp He He Gly Ala Cys Tyr Pro Glu Gly His Pro 

"0 135 140 

Asp Ser Pro Asn Gin He Ser Asp He Gin Asn Leu Lys Lys Lys Val 
145 150 155 160 

Asp Ala Gly Cys Ser Ser Leu Val Thr Gin Leu Phe Phe Asp Asn Glu 

165 170 i7 5 

Arg Phe Tyr Asp Phe Gin Asp Lys Cys He Leu Ala Gly He Asp Val 

180 185 190 

Pro He His Ala Gly He Met Pro He Leu Asn Arg Asn Gin Ala Leu 

195 200 205 

Arg Leu Leu Lys Thr Cys Glu Asn He His Leu Pro Arg Lys Phe Lys 

210 215 220 

Ala He Leu Asp Lys Tyr Glu His Asp Pro Glu Ser Leu Arg Ala Ala 
225 230 235 24Q 

Gly Leu Ala Tyr Ala Val Asp Gin He Val Asp Leu Val Thr Gin Asp 

245 250 255 

Val Ala Gly Val His Leu Tyr Thr Met Asn Asn Ala Asp Thr Ala Lys 

260 265 270 

Tyr He His Gin Ala Thr His Ala Leu Phe Asn His Gin Ser Leu Gly 
275 280 285 

(2) INFORMATION FOR SEQ ID NO: 3 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

Leu Glu Glu Leu Ala Leu Trp Phe Glu Lys Leu His Asp Phe Pro Gin 

15 10 15 

Leu Gin Gly Asn Leu Gin Ala Phe Asn Asp Ala Gly Phe He Glu Asn 

20 25 30 

Phe Ala Ser Glu Glu Leu Ala Arg He Arg Arg Lys He His Asp Ser 

35 40 45 

Glu Ser Gin Val Arg Asp Val Leu Gin Asp Leu Leu Lys Gin Lys Ala 

50 55 60 

Gin Met Leu Thr Glu Gly He Val Ala Ser Arg Asn Gly Arg Gin Val 
65 70 75 80 

Leu Pro Val Lys Asn Thr Tyr Arg Asn Lys He Ala Gly Val Val His 

85 90 95 

Asp He Ser Ala Ser Gly Asn Thr Val Tyr He Glu Pro Arg Glu Val 

1°0 105 no 

Val Lys Leu Ser Glu Glu He Ala Ser Leu Arg Ala Asp Glu Arg Tyr 

115 120 X25 

Glu Met Leu Arg He Leu Gin Glu He Ser Glu Arg Val Arg Pro His 

"0 135 140 

Ala Ala Glu He Ala Asn Asp Ala Trp He He Gly His Leu Asp Leu 
145 150 155 16Q 

He Arg Ala Lys Val Arg Phe He Gin Glu Arg Gin Ala Val Val Pro 

165 170 175 

Gin Leu Ser Glu Asn Gin Glu He Gin Leu Leu His Val Cys His Pro 

180 185 190 

Leu Val Lys Asn Ala Val Ala Asn Asp Val Tyr Phe Gly Gin Asp Leu 

195 200 205 

Thr Gly Tyr Cys His Tyr Arg Ser Gin Tyr Arg Trp Glu Asp His His 

21° 215 220 

Ala Gin Asn Ser Gly Leu Asp Thr Gly His Gly Pro Val Arg He Ala 
225 230 235 240 

Asp Phe Ser Arg Gin Gly Lys Ser Cys Trp Tyr Phe 
245 250 



(2) INFORMATION FOR SEQ ID NO: 32 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:326: 

Met Gly Ala Tyr Tyr Leu Val Met Gin Ser Leu Ser Tyr Leu Glu Tyr 

1 5 10 15 

Glu Gin Gly lie oln Ser Thr Thr Val Arg His Leu lie Leu Val Phe 

20 25 30 

Tyr Leu Leu Phe Phe Met Gly Gly lie Lys Lys Leu Asp tL Tyr Leu 

35 40 45 

Lys Glu Lys Leu Gin Glu Glu Leu Asn Gin Glu Gin Thr Leu Arg Tyr 

60 

Arg Asp Met Glu Arg Tyr Ser Arg His He Glu Glu Leu Tyr Lys Glu 

*? 70 "» 80 

lie Arg Ser Phe Arg His Asp Tyr Thr Asn Leu Leu Thr Ser Tyr Val 



85 90 



95 



Trp Ala Leu Lys Arg Arg He Trp Ser Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO:327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.-327: 

Met ser Leu Leu Asp Phe Leu Thr He Val Ser lie Leu Cys Asp Asn 

10 15 
Ala lie Lys Ala Ser Ala G!u Ala Ser Gin Pro His Val Ser lie Ala 



20 



25 



30 



Phe Leu Lys Asn Gly Ala Gin Glu Thr Phe He lie Glu Asn Ser lie 

Lys Glu Glu Gly He Asp lie Ser Glu He Phe Ser 11 Gly Ala Ser 

50 *5 60 

Ser Lys Gly Glu Glu Arg Gly Val Gly Leu Tyr Thr Val 



65 



70 



Met Lys He 
">5 80 
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Val Glu Ser His Pro Asn Thr Asn Leu Asn Thr Thr Cys Gin Asn Gin 

85 90 95 

Val Phe Arg Gin Val Leu Thr Val He His Ala Glu 



(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 

Leu He Ser Leu Gly Asn Leu Lys Met Asn His Ser Ser His Glu Val 

1 5 10 15 

Gin He Gly Glu Lys Met Leu Asp Leu Thr Val Lys Ser Phe Glu Leu 

20 25 30 

Leu Trp He Leu Ala Ser Asn Pro Glu Arg Val Phe Ser Lys Thr Asp 

35 40 45 

Leu Tyr Glu Lys He Trp Lys Glu Asp Tyr Val Asp Asp Thr Asn Thr 

50 55 60 

Leu Asn Val His He His Ala Leu Arg Gin Glu Leu Ala Lys Tyr Ser 
65 70 75 80 

Ser Asp Gin Thr Pro Thr He Lys Thr Val Trp Gly Leu Gly Tyr Lys 

85 90 95 

He Glu Lys Pro Arg Gly Gin Thr 
100 

(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



100 



105 



(2) INFORMATION FOR SEQ ID NO: 328: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 
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Met Ala Gin Thr Ser Lys Thr Thr Val Ser Phe Tyr Leu Asn Gly Lys 

IS 10 15 

Tyr Glu Lys Met Ser Gin Glu Thr Arg Glu Lys He Glu Lys Val He 

20 25 30 

His Glu Thr Asn Tyr Lys Pro Ser He Val Ala Arg Ser Leu Lys Leu 

35 40 45 

Gin Thr Asn Lys He Asn Arg Cys Phe Asp Arg 
50 55 



(2) INFORMATION FOR SEQ ID NO: 330: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

Met His Tyr He Glu Gly Ala Tyr Trp Ser Ala Glu Pro Asp Leu Pro 

1 5 io 15 

Tyr Leu His Lys Arg Lys He Leu Met Leu Val Glu Asp Val Trp Leu 

20 25 30 

Leu Val Asp Asp He Arg Cys Gin Gly Gin His Glu Ala Leu Thr Gin 

3 5 40 45 

Phe He Leu Asp Lys Asp Val Thr Tyr Gin Asp Gly Lys He Asn Gin 

50 55 60 

Leu Arg Leu Trp Ser Glu Val Asp Phe Asp Leu Glu Asp Thr He He 
65 70 75 80 

Ser Pro Lys Tyr Asn Glu Leu Glu Arg Ser Ser Lys Leu Thr Lys Arg 

85 90 95 

Gin Phe Phe Glu Asn Gin Met Leu Asp Tyr Thr He He Ala His Glu 

100 105 no 

Ser Phe Glu He He Arg His Ser Val Tyr Gin Thr Asp Asp Arg Glu 

115 120 125 

Val Glu Asn Ala Leu Ala Phe Glu Val Lys Asn Asp Glu Thr Asp Lys 

130 135 140 

Leu He Leu Leu Leu Ser Glu Asp He Gly Val Gly Glu Lys Leu Cys 
145 150 155 160 
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Leu Val Asp Gly Thr Lys Met Arg Gly Lys Cys Leu Val Tyr Asp Lys 

165 170 175 

lie Asn Glu Arg Met lie Arg Leu Gin Cys 



(2) INFORMATION FOR SEQ ID NO: 331; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 amino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

Met Asn Cys Glu Ala Val Ala Leu Gly Ser Phe Cys Glu Leu Lys Ser 

15 10 15 

Arg Arg Gly Lys Lys Gin Lys Gly Glu lie Leu Met Ala Val He Ser 

20 25 30 

Met Lys Gin Leu Leu Glu Ala Gly Val His Phe Gly His Gin Thr Arg 

35 40 45 

Arg Trp Asn Pro Lys Met Ala Lys Tyr He Phe Thr Glu Arg Asn Gly 

50 55 60 

He His Val He Asp Leu Gin Gin Thr Val Lys Tyr Ala Asp Gin Ala 
65 70 75 80 

Tyr Xaa Phe Met Arg Asp Ala Ala Ala Asn Asp Ala Val Val Leu Phe 

85 90 95 

Val Gly Thr Lys Lys Gin Ala Ala Asp Ala Val Ala Glu Glu Ala Val 

100 105 no 

Arg Ser Gly Gin Tyr Phe He Asn His Arg Trp Leu Gly Gly Thr Leu 

115 120 125 

Thr Asn Trp Gly Thr He Gin Lys Arg He Ala Arg Leu Lys Glu He 

130 135 140 

Lys Arg Met Glu Glu Asp Gly Thr Phe Glu Val Leu Pro Lys Lys Glu 
145 150 155 160 

Val Ala Leu Leu Asn Lys Gin Arg Ala Arg Leu Glu Lys Phe Leu Gly 

165 170 175 

Gly He Glu Asp Met Pro Arg He Pro Asp Val Met Tyr Val Val Asp 

180 185 190 

Pro His Lys Glu Gin He Ala Val Lys Glu Ala Lys Lys Leu Gly He 
195 200 205 



180 



185 
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Pro Val Val Ala Met Val Asp Thr Asn Thr Asp Pro Asp Asp lie Asp 

210 215 220 

Val lie He Pro Ala Asn Asp Asp Ala He Arg Ala Val Lys Leu He 
225 230 235 240 

Thr Ala Lys Leu Ala Asp Ala He He Glu Gly Arg Gin Gly Glu Asp 

245 250 255 

Ala Val Ala Val Glu Ala Glu Phe Ala Ala Pro Glu Thr Gin Ala Asp 

260 2 65 270 

Ser He Glu Glu He Val Glu Val Val Glu Gly Asp Asn Ala 
275 280 285 

(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 202 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 

Met Lys Gin Leu He Ser Leu Lys Asn He Phe Arg Ser Tyr Arg Asn 

Gly Asp Gin Glu Leu Gin Val Leu Lys Asn He Asn Leu Glu Val Asn 

20 " 30 

Glu Gly Glu Phe Val Ala lie Met Gly Pro Ser Gly Ser Gly Lys Ser 

35 40 45 

Thr Leu Met Asn Thr He Gly Met Leu Asp Thr Pro Thr Ser Gly Glu 

50 " 60 

Tyr Tyr Leu Glu Gly Gin Glu val Ala Gly Leu Gly Glu Lys Gin Leu 
65 70 75 eo 

Ala Lys Val Arg Asn Gin Gin He Gly Phe Val Phe Gin Gin Phe Phe 

85 90 95 

Leu Leu Ser Lys Leu Asn Ala Leu Gin Asn Val Lys Leu Pro Leu lie 

100 105 no 

Tyr Ala Gly Val Ser Ser Ser Lys Arg Arg Lys Leu Ala Glu Glu Tyr 

115 "0 125 

Leu Asp Lys Val Glu Leu He Glu Arg Ser His His Leu Pro Ser Glu 

«0 135 14Q 

Leu Ser Gly Gly Gin Lys Gin Arg Val Ala He Ala Arg Ala Leu Val 

145 150 155 160 
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Asn Asn Pro Ser He He Leu Ala Asp Glu Pro Thr Gly Ala Leu Asp 

165 170 175 

Thr Lys Thr Gly Asn Gin He Met Gin Leu Leu Val Asp Leu Asn Lys 

180 185 190 

Glu Arg Lys Asn His Tyr His Gly Asn Ala 
195 200 



<2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

Met Thr Ser Val He Lys Trp Thr Leu Arg Lys Leu Phe Gin Ser Gin 

1 5 io 15 

Pro Phe Arg Gly Asn Asn Val Ser Arg Leu Val Asp He Leu Ser Glu 

20 25 30 

Asn Leu Asp Glu Gly Phe Gin Tyr Phe Pro Ser Asp Gin He Thr Asp 

35 40 45 

His Pro Glu Arg Phe Leu Val Ser Glu Met Val Arg Glu Lys Val Leu 

50 55 60 

His Leu Thr Arg Glu Glu He Pro His Ser Val Ala Val Val Val Asp 
65 70 75 80 

Ser Met Lys Arg Asp Glu Glu Thr Asp Lys Val His He Arg Ala Thr 

85 90 95 

He Met Val Glu Arg Asp Ser Gin Lys Gly He He He Gly Lys Gly 

100 105 no 

Gly Ala Met Leu Lys Lys He Gly Ser Met Ala Arg Arg Asp He Glu 

H5 120 125 

Leu Met Leu Gly Asp Lys Val Phe Leu Glu Thr Trp Val Lys Val Lys 

!3° 135 140 

Lys Asn Trp Arg Asp Lys Lys Leu Asp Leu Ala Asp Leu Gly Tyr Asn 
145 150 155 160 

Glu Arg Glu Tyr 



(2) INFORMATION FOR SEQ ID NO: 334: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

Met Thr Phe Lys Ser Gly Phe Val Ala lie Leu Gly Arg Pro Asn Val 

1 5 10 15 

Gly Lys ser Thr Phe Leu Asn His Val Met Gly Gin Lys lie Ala He 

20 25 30 

Met Ser Asp Lys Ala Gin Thr Thr Arg Asn Lys He Met Gly lie Tyr 

35 40 45 

Thr Thr Asp Lys Glu Gin He Val Phe lie Asp Thr Pro Gly lie His 

50 55 60 

Lys Pro Lys Thr Ala Leu Gly Asp Phe Met Val Glu Ser Ala Tyr Ser 
'* 7 ° 75 bo 

Thr Leu Arg Glu Val Asp Thr Val Leu Phe Met Val Pro Ala Asp Glu 

85 90 95 

Ser Arg Gly Lys Gly Asp Asp Met lie lie Glu Arg Leu Lys Ala Ala 

100 los 110 

Lys Val Pro Val lie Leu Val Val Asn Lys He Asp Lys Val His Pro 

H5 120 125 

Asp Gin Leu Leu Ser Gin He Asp Asp Phe Arg Asn Gin Met Asp Phe 

130 135 140 

Lys Glu He Val Pro He Ser Ala Leu Gin Gly Lys 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE; Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 
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Met Thr Glu He Arg Leu Glu His Val Ser Tyr Ala Tyr Gly Gin Glu 

1 5 io is 

Arg He Leu Glu Asp He Asn Leu Gin Val Thr Ser Gly Glu Val Val 

20 25 30 

Ser He Leu Gly Pro Ser Gly Val Gly Lys Thr Thr Leu Phe Asn Leu 

35 40 45 

He Ala Gly He Leu Glu Val Gin Ser Gly Arg He Val Leu Asp Gly 

50 55 60 

Glu Glu Asn Pro Lys Gly His Val Ser Tyr Met Leu Gin Lys Asp Leu 
65 7 ° 75 bo 

Leu Leu Glu His Lys Thr Val Leu Gly Asn He He Leu Pro Leu Leu 

85 90 95 

He Gin Lys Val Asp Lys Ala Glu Ala lie Ser Arg Ala Asp Lys He 

100 105 no 

Leu Ala Thr Phe Gin Leu Thr Ala Val Arg Asp Lys Tyr Pro His Glu 

115 120 125 

Leu Ser Gly Gly Met Arg Gin Arg Val Ala Leu Leu Arg Thr Tyr Leu 

130 135 140 

Phe Gly His Lys Asn Arg Asn Pro 
145 . iso 

(2) INFORMATION FOR SEQ ID NO: 336: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

Met Thr Lys Met Glu Leu His Ala Trp Tyr Leu Glu He His Lys Gin 

1 5 10 15 

Leu Gin Leu Thr Thr Leu He Xaa Thr His Ser He Glu Glu Ala Leu 

20 25 30 

Xaa Leu Ser Asp Arg He Tyr He Leu Xaa Asn Xaa Pro Gly Gin He 

35 40 45 

Val Ser Glu He Lys Leu Asp Trp Ser Glu Asp Glu Asp Xaa Glu Val 
50 55 60 
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Xaa Lys lie Ala Xaa Lys Arg Gin He Leu Ala Glu Leu Gly Leu Asp 

65 70 75 80 

Lys 



(2) INFORMATION FOR SEQ ID NO: 337 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:337: 

Met Arg Cys. Lys Met Leu Asp Pro lie Ala lie Gin Leu Gly Pro Leu 

" 5 " 15 

Ala He Arg Trp Tyr Ala Leu Cys lie Val Thr Gly Leu lie Leu Ala 

20 25 30 

Val Tyr Leu Thr Met Lys Glu Ala Pro Arg Lys Lys He lie Pro Asp 

35 40 45 

Asp lie Leu Asp Phe lie Leu Val Ala Phe Pro Leu Ala lie Leu Gly 

50 " 60 

Ala Arg Leu Tyr Tyr Val lie Phe Arg Phe Asp Tyr Tyr Ser Gin Asn 
" 70 75 80 

Leu Gly Glu lie Phe Ala lie Trp Asn Gly Gly Leu Ala lie Tyr Gly 



85 90 95 

Gly Leu lie Thr Gly Ala Leu Val Leu Tyr He Phe Ala Asp Arg Lys 

100 105 110 

Leu lie Asn Thr Trp Asp Phe Leu Asp lie Ala Ala Pro Ser Val Met 

115 12 ° 125 

He Ala Gin Ser Leu Gly Arg Trp Gly Asn Phe Phe Asn Gin Glu Ala 

130 "5 140 

ryr Gly Ala Thr Val Asp Asn Leu Asp Tyr Leu Pro Gly Phe lie Arg 

" 5 150 155 160 

Asp Gin Met Tyr lie Glu Gly Ser Tyr Arg Gin Pro Thr Phe Leu Tyr 

1«5 170 1?5 

Glu Ser Leu Trp Asn Leu Leu Gly Phe Ala Leu lie Leu lie Phe Arg 

«0 185 19Q 

Arg Lys Trp Lys Ser Leu Arg Arg Gly His lie Thr Ala Phe Tyr Leu 
195 200 205 
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He Trp Tyr Gly Phe Gly Arg Met Val He Glu Gly Met Arg Thr Asp 

210 215 220 

Ser Leu Met Phe Phe Gly Leu Arg Val Ser Gin Trp Leu Ser Val Val 
225 230 235 240 

Phe He Gly Leu Gly He Met He Val He Tyr Gin Asn Arg Lys Lys 

245 250 255 

Ala Pro Tyr Tyr He Thr Glu Glu Glu Asn 



(2) INFORMATION FOR SEQ ID NO: 338: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 

Met Ser Lys He lie Asp Arg Tyr Arg Pro Ala Gly Glu Gly Phe Val 

1 5 io is 

Arg He Asp Thr Gin Asn Asn Met Pro Thr Ala Ala Gly Leu Ser Ser 

20 25 30 

Ser Ser Ser Gly Leu Ser Ala Leu Val Lys Ala Cys Asn Ala Tyr Phe 

35 40 45 

Lys Leu Gly Leu Asp Arg Ser Gin Leu Ala Gin Glu Ala Lys Phe Ala 

50 55 60 

Ser Gly Ser Ser Ser Arg Ser Phe Tyr Gly Pro Leu Gly Ala Trp Asp 
65 70 75 80 

Lys Asp Ser Gly Glu He Tyr Pro Val Glu Thr Asp Leu Lys Leu Ala 

85 90 95 

Met He Met Leu Val Leu Glu Asp Lys Lys Lys Pro He Ser Ser Arg 

100 105 no 

Asp Gly Met Lys Leu Cys Val Glu Thr Ser Thr Thr Phe Asp Asp Trp 

115 120 125 

Val Arg Gin Ser Glu Lys Asp Tyr Gin Asp Met Leu He Tyr Leu Lys 

130 135 140 

Glu Asn Asp Phe Ala Lys He Gly Glu Leu Thr Glu Lys Asn Ala Leu 
14 5 150 155 160 

Ala Met His Ala Thr Thr Lys Thr Ala Ser Pro Ala Phe Ser Tyr Leu 



260 



265 



165 



170 



175 
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Thr Asp Ala Ser Tyr Glu Ala Met Ala Phe Val Arg Gin Leu Arg Glu 

180 185 190 

Lys Gly Glu Ala Cys Tyr Phe Thr Met Asp Ala Gly Pro Asn Val Lys 

195 200 205 

Val Phe Cys Gin Glu Lys Asp Leu Glu His Leu Ser Glu lie Phe Gly 

210 215 220 

Gin Arg Tyr Arg Leu He Val Ser Lys Thr Lys Asp Leu Ser Gin Asp 
225 230 235 

Asp Cys Cys 



240 



(2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

Met His Asn Gly Phe Val Asn He Asp Asn Val Lys Met Ser Lys Ser 

1 5 10 15 

Leu Gly Asn Phe He Thr Val His Asp Ala Leu Lys Thr Leu Asp Gly 

20 25 30 

Gin Val Leu Arg Phe Phe Phe Ala Thr Gin His Tyr Arg Lys Pro He 

35 40 45 

Asn Phe Thr Glu Lys Ala Val Arg Asp Ala Glu Thr Asn Leu Lys Tyr 

50 55 60 

Leu Lys Asn Thr Tyr Glu Gin Pro Phe Thr Gly Asn Val Asp Ala Gin 
65 70 75 80 

Glu Leu Gin Asn Phe Lys Asp Lys Phe Val Ala Ala Met Asp Glu Asp 

85 90 95 

Phe Asn Ala Ala Asn Gly He Thr Val Val Phe Glu Met Ala Lys Trp 

100 105 no 

He Asn Ser Gly Asn Tyr Asp Ala Ser Val Lys Gin Ala Leu Ala Asp 

115 120 125 

Met Leu Glu He Phe Gly He Val Phe Val Glu Glu Val Leu Asp Ala 

130 135 i 4 o 

Glu He Glu Asp Leu He Gin Lys Arg Gin Glu Ala Arg Ala Asn Arg 
145 150 155 160 
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Asp Phe Glu Thr Ala Asp Gin lie Arg Asp Gin Leu Val Thr Gin Gly 

165 170 175 

lie Lys Leu Leu Asp Thr Lys Asp Gly Val Arg Trp Thr Arg Asp 



(2) INFORMATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 

Met Lys Arg Thr Gly Leu Phe Thr Lys He Phe He Tyr Thr Phe Ser 

15 10 15 

He Phe Ser Val Leu Val He Cys Leu His Leu Ala He Tyr Phe Leu 

20 25 30 

Phe Pro Ser Thr Tyr Leu Ser His Arg Gin Glu Thr He Gly Gin Lys 

35 40 45 

Ala Thr Ala He Ala Gin Ser Leu Glu Gly Lys Asp Arg Gin Ser He 

50 55 60 

Glu Gin Val Leu Asp Leu Tyr Ser Gin Thr Ser Asp He Lys Gly Thr 
65 70 75 80 

Val Lys Gly Glu Met Thr Glu Asp Lys Leu Glu Val Lys Asp Ser Leu 

85 90 95 

Pro Leu Asp Thr Asp Arg Gin Thr Thr Ser Leu Phe He Glu Glu Arg 

100 105 no 

Glu Val Lys Thr Gin Asp Gly Gly Thr Met He Leu Gin Phe Leu Ala 

115 120 125 

Ser Met Asp Leu Gin Lys Glu Ala Glu Gin He Ser Leu Gin Phe Leu 

130 135 140 

Pro Tyr Thr Leu Leu Ala Ser Phe Leu He Ser Leu Leu Val Ala Tyr 
145 150 155 160 

He Tyr Ala Arg Thr lie Val Ala Pro He Leu Glu He Lys Arg Val 

165 170 175 

Thr Arg Arg Met Met Asp Leu Asp Ser Gin Val Arg Leu Arg Val Asp 

180 185 190 

Ser Lys Asp Glu He Gly Asn Leu Lys Glu Gin He Asn Ser Leu Tyr 



180 



185 



190 



195 



200 



205 
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Gin His Leu Leu Thr Val lie Ala Asp Leu His Glu Lys Asn Glu Ala 

210 215 220 

lie Leu Gin Leu Glu Lys Met Lys Val Glu Phe Leu Gin Gly Ala Ser 
225 230 235 240 

His Glu Leu Lys His Thr Ala Gly 
245 

(2) INFORMATION FOR SEQ ID NO: 341: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

Leu Leu Glu Ser Tyr Ser Arg Glu Gin Val Asn Phe Phe Val Asp Gin 

Gly Ala Val Thr He Val Gin Lys Glu Val Arg Arg Ser Ala Ala Tyr 

20 25 30 

Phe Glu Gly He Glu Ala Ser Arg Pro Leu Glu Leu Asn Pro Glu Gin 

35 40 45 

Arg Gin Ala Arg Asp Ala Val Val Ser Ser He Gly Ser Ser Gin Pro 

50 55 60 

Pro Phe Leu Leu Gin Gly He Thr Gly Ser Gly Lys Thr Glu Val Tyr 
" 70 75 so 

Leu Gin He He Gin Gly Ala Leu Asp Lys Gly Lys Thr Ala He Leu 

85 90 95 

Leu Val Pro Glu lie Ser Leu Thr Pro Gin Met Thr Glu Arg Phe He 

100 105 110 

Ala Arg Phe Gly Asp Lys Val Ala He Leu His Ser Gly Leu Ser Asn 

US 120 125 

Gly Glu Lys Tyr Asp Glu Trp Arg Lys Val Glu Arg Giy Asp Ala Gin 

130 135 140 

Val val val Gly Ala Arg Ser Ala He Phe Ala Pro Leu Lys Asn Leu 
145 150 155 160 

Gly Val Met He He Asp Glu Glu His Glu Ala Ala Tyr Lys Gin Asp 

165 170 175 

Ser Asn Pro Arg Tyr His Ala Arg Glu Val Ala He Leu Arg Ala Arg 
180 185 190 
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Tyr Asn Gin Ala Thr Leu Val Leu Gly Ser Ala Thr Pro Ser Leu Glu 

195 200 205 

Ser Arg Ala Arg Ala Gly Lys Gly Val Tyr Gin His Leu Arg Leu Thr 

210 215 220 

Gin Arg Ala Asn Pro Leu Ala Thr lie Pro Glu Val Ser Ser Asp 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

Met Cys Met Val Met Gin Leu Leu Gly His Gin Leu Val Leu He Gly 

15 10 is 

Phe Lys Gly Ser Arg Lys Cys Thr Pro Phe Ala Ala Gin Met Ala Ser 

20 25 30 

Glu Ala Ala Ala Lys Ser Ala Gin Glu His Gly Leu Lys Ser Val Glu 

35 40 45 

Val Thr Val Lys Gly Pro Gly Ser Gly Arg Glu Ser Ala He Arg Ala 

50 55 60 

Leu Ala Ala Ala Gly Leu Glu Val Thr Ala lie Arg Asp Val Thr Pro 
65 70 75 80 

Val Pro His Asn Gly Ala Arg Pro Pro Lys Arg Arg Arg Val 
85 90 

(2) INFORMATION FOR SEQ ID NO: 343: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 
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Met He Ser Leu Pro Glu Asn Leu Phe Ala Asn Ala Lys Gin Ser Lys 

15 10 15 

Thr He Phe He Leu Gin Lys Lys Ser Glu He Ala Val Glu Pro Phe 

20 25 30 

Val Tyr Pro Leu Ala Ser Leu Gin Asp Ala Ser Val Leu Met Lys Phe 

35 40 45 

Lys Glu Asn Phe Gin Lys Trp Thr Gin Gly Thr Glu He 
50 55 6 o 

(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:344: 

Met Gly Lys Phe Arg Gin Ser Gin Pro Ser Val Ser Asn Asn Gly Leu 

1 5 10 15 

Ala Thr Val Val Pro Ser Val Arg Glu Gly Glu Pro Val Arg Val He 

20 25 30 

Ala Lys Ala Glu Asn Gly Asp He Leu Gly Glu Tyr Arg Leu His Phe 

35 40 45 

Thr Lys Asp Lys Asn Leu Leu Ser His Lys Pro Val Ala Ala Val Lys 

50 55 60 

Gin Ala Arg Leu Leu Gin Val Gly Gin Ala Leu Glu Leu Pro Thr Lys 
65 70 75 80 

Val Pro Val Tyr Phe Thr Gly Lys Asp Gly Tyr Glu Thr Lys Asp Leu 

85 9 0 95 

Ser Val Glu Trp Glu Glu Val Pro Ala Glu Asn Leu Thr Lys Ala Gly 

100 105 no 

Gin Phe Thr Val Arg Gly His Val Leu Gly Ser Asp Leu Val Ala Glu 

US 120 125 

Val Thr Val Arg Val Thr Asp Lys Leu Gly Glu Ala Leu Ser Asp Asn 

130 "5 140 

Pro Asn Tyr Asp Glu Asn Ser Asn Gin Ala Phe Ala Ser Ala Thr Asn 

145 iso 155 160 
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Asp He Asp Lys Asn Ser His Asp Arg Val Asp Tyr Leu Asn Asp Gly 
165 170 175 

Asp His Ser Glu Asn Arg Arg Trp Thr Asn Trp Ser Pro Thr Pro Ser 

180 185 190 

Ser Asn Pro Glu Val Ser Ala Gly Val He Phe Arg Glu Asn Gly Lys 

!95 200 205 

He Val Glu Arg Thr Val Ala Gin Ala Lys Leu His Phe Phe Ala Asp 

210 215 2 2o 

Ser Gly Thr Asp Ala Pro Thr Lys Leu Val Leu Glu Arg Tyr Val Gly 
225 230 235 240 

Pro Glu Phe Glu Val Pro Thr Tyr Tyr Ser Asn Tyr Gin Ala Tyr Asp 

2*5 250 255 

Ala Asp His Pro Phe Asn Asn Pro Glu Asn Trp Glu Ala Val Pro Tyr 

260 265 270 

Arg Ala Asp Lys Asp He Ala Ala Gly Asp Glu He Asn Val Thr Phe 

275 280 285 

Lys Ala Val Lys Ala Lys Ala Met Arg Trp Arg Met Glu Arg Lys Ala 

290 295 300 

Asp Lys Ser Gly Val Ala Met He Glu Met Thr Phe Leu Ala Pro Ser 
305 310 315 320 

Glu Leu Pro Gin Glu Ser Thr Gin Ser Lys He Leu Val Asp Gly Lys 

32 5 330 335 

Glu Leu Ala Asp Phe Ala Glu Asn Arg Gin Asp Tyr Gin He Thr Tyr 

340 345 350 

Lys Gly Gin Arg Pro Lys Val Ser Val Glu Glu Asn Asn Gin Val Ala 

355 3 6 o 365 

Ser Thr Val Val Asp Ser Gly Glu Asp Cys Leu Pro Val Leu Val Arg 

370 375 3 8 o 

Leu Val Ser Glu Ser Gly Lys Gin Val Lys Glu Tyr Arg Tyr Pro Val 
385 390 395 400 

Asp 



(2) INFORMATION FOR SEQ ID NO: 3 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 

Met Asn He Gin Glu Glu He Lys Lys Arg Arg Thr Phe Ala lie lie 

15 10 15 

Ser His Pro Asp Ala Gly Lys Thr Thr He Thr Glu Gin Leu Leu Tyr 

20 25 30 

Phe Gly Gly Glu He Arg Glu Ala Gly Thr Val Lys Gly Lys Lys Thr 

35 40 45 

Gly Thr Phe Ala Lys Ser Asp Trp Met Asp He Glu Lys Gin Arg Gly 

50 55 60 

He Ser Val Thr Ser Ser Val Met Gin Phe Asp Tyr Asp Gly Lys Arg 
65 ™ 75 so 

Val Asn He Leu Asp Thr Pro Gly His Glu Asp Phe Ser Glu Asp Thr 

85 g 0 95 

Tyr Arg Thr Leu Met Ala Val Asp Ala Ala Val Met Val Val Asp Ser 

100 105 no 

Ala Lys Gly He Glu Ala Gin Thr Lys Lys Leu Phe Glu Val Val Lys 

115 120 125 

His Arg Gly He Pro Val Phe Thr Phe Met Asn Lys Leu Asp Arg Asp 

130 135 140 

Gly Arg Glu Pro Leu Asp Leu Leu Gin Glu Leu Glu Glu He Leu Gly 
145 150 155 16Q 

He Ala Ser Tyr Pro Met Asn Trp Pro He Gly Met Gly Lys Ala Phe 

165 170 175 

Glu Gly Leu Tyr Asp Leu Tyr Asn Gin Arg Leu Glu Leu Tyr Lys Gly 

"0 185 190 

Asp Glu Arg Phe Ala Ser Pro Arg Arg Trp Arg Gin Thr Phe Trp 
"5 200 205 

(2) INFORMATION FOR SEQ ID NO:346: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 46: 
Met Leu He Asn Ser Leu Asn Gin Ser Tyr He Ser Ala Lys Arg He 
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Glu Glu Val Phe Ala Glu Ser Pro Glu Asn lie His Ser Glu Leu Glu 

20 25 30 

Gin Lys Gin Val Thr Ser Gly Arg Val Leu Gin Val Gin Glu Leu Thr 

35 40 45 

Phe Thr Tyr Pro Asp Ala Ala Gin Pro Ser Leu Arg Asp lie Ser Phe 

50 55 60 

Asp Met Thr Gin Gly Gin lie Leu Gly He He Gly Gly Thr Gly Ser 
65 70 75 80 

Gly Lys Ser Ser Leu Val Gin Leu Leu Leu Gly Leu Tyr Pro Val Asp 

85 90 95 

Lys Gly Asn He Asp Leu Tyr Gin Asn Gly Arg Ser Pro Leu Asn Leu 

100 105 no 

Glu Gin Trp Arg Ser Trp He Ala Tyr Val Pro Gin Lys Val Lys Leu 

115 120 125 

Phe Lys Gly Thr He Arg Ser Asn Leu Thr Leu Gly Leu Asn Gin Glu 

130 135 140 

Val Ser Asp Gin Lys Leu Trp Gin Ala Leu Glu He Ala Gin Ala Lys 
145 150 155 16(J 

Asp Phe Val Ser Glu Lys Glu Gly Leu Leu Asp Ala Leu He Glu Ala 

165 170 175 

Gly Gly Arg Asn Phe Ser Gly Gly Gin Lys Gin Arg Leu Ser He Ala 

180 185 190 

Arg Ala Val Leu Arg Gin Ala Pro Phe He He Leu Asp Asp Ala Thr 

lg 5 200 205 

Ser Ala Leu Asp Thr He Thr Glu Ser Lys Leu Leu Lys Ala He Arg 

210 215 220 

Glu Asn Phe Pro Asn Thr Ser Leu He Leu He Ser Gin Arg Thr Ser 
225 230 235 240 

Thr Leu Gin Met Ala Asp Gin He Leu Leu Leu Glu Lys Gly Glu Leu 

245 250 255 

Leu Ala Val Gly Lys His Asp Asp Leu Met Lys Ser Ser Gin Val Tyr 

2fi 0 265 270 

Arg Glu He Asn Ala Ser Gin His Gly Lys Glu Asp 
275 280 



(2) INFORMATION FOR SEQ ID NO: 3 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 

Met He Asp Asp Ala Gly Val Gly Val Lys He Leu Thr Ser Ser Lys 

Asp Ala Asn Asp Leu Leu Glu Lys Lys He Asp Gly Leu He Glu Lys 

20 25 30 

Phe Lys His Ala Tyr Ala Asn Val Lys He Glu Lys Leu Glu Thr He 

35 40 45 

Asn Ser Lys Glu He Glu Arg Lys Trp 
50 55 



(2) INFORMATION FOR SEQ ID NO: 348: 
* 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 

Met Thr Gly Ala Gly Phe Gly Gly Cys Ala He Ala Leu Val Gin Lys 

15 10 15 

Asp Thr Val Glu Ala Phe Lys Glu Ala Val Gly Lys Leu Tyr Lys Glu 

20 25 30 

Val Val Gly Tyr Pro Xaa Ser Leu Leu Tyr Arg 
35 40 



(2) INFORMATION FOR SEQ ID NO: 3 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 
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Met Ala Gin His Leu Thr Thr Glu Ala Leu Arg Lys Asp Phe Leu Ala 

1 5 10 is 

Val Phe Gly Gin Glu Ala Asp Gin Thr Phe Phe Ser Pro Gly Arg lie 

20 25 30 

Asn Leu lie Gly Glu His Thr Asp Tyr Asn Gly Gly His Val Phe Thr 

35 40 45 

Trp Cys Tyr Phe Leu Gly Asn Leu Arg Cys Ser Ser 



(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

Met Thr Asp Arg Tyr Asn Tyr Glu Ala Val Gin Met Ala Phe Leu Pro 

1 5 10 15 

Thr Lys Gin Arg Ala Asn Met Gly Phe Gly He Cys Gly Phe Ala Asn 

20 25 30 

Thr Val Asp Thr Leu Ser Ala He Lys Tyr Ala Thr Val Lys Pro He 

35 40 45 

Arg Asp Glu Asp Gly Tyr He Tyr Asp Tyr Glu Thr He Gly Asp Tyr 

50 55 60 

Pro Arg Trp Gly Glu Asp Asp Pro Arg Ser Asn Glu Leu Ala Glu Trp 
65 70 75 80 

Leu He Glu Ala Tyr Thr Thr Arg Leu Arg Ser His Lys Leu Tyr Lys 

85 90 95 

Asp Ala Glu Ala Thr Val Ser Leu Leu Thr He Thr Ser Asn Val Ala 

100 105 HO 

Tyr Ser Lys Gin Thr Gly Asn Ser Pro Val His Lys Gly Val Tyr Leu 

115 120 125 

Asn Glu Asp Gly Ser Val Asn Leu Ser Lys Leu Glu Phe Phe Ser Pro 

130 135 140 

Gly Ala Asn Pro Ser Asn Lys Ala Lys Gly Gly Trp Leu Gin Asn Leu 
145 150 155 160 
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Asn Ser Leu Ser Ser Leu Asp Phe Ser Tyr Ala Ala Asp Gly He Ser 
165 170 175 

Leu Thr Thr Gin Val Ser Pro Arg Ala Leu Gly Lys Thr Arg Asp Glu 

180 185 190 

Gin Val Asp Asn Leu Val Thr lie Leu Asp Gly Tyr Phe Glu Asn Gly 

195 200 205 

Gly Gin His Val Asn Leu Asn Val Met Asp Leu Asn Asp Val Tyr Glu 

210 215 220 

Lys He Met Ser Gly Glu Asp Val He Val Arg He Ser Gly Tyr Cys 
225 230 235 240 

Val Asn Thr Lys Tyr Leu Thr Pro Glu Gin Lys Thr Glu Leu Thr Gin 

245 250 255 

Arg Val Phe His Glu Val Leu Ser Met Asp Asp Ala Le U Asp Ala Leu 

260 265 270 

Ser Ser Asn Pro Ala Ala Pro Gly Ser His 
275 280 

(2) INFORMATION FOR SEQ ID NO:351: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351 : 

Met Met He Ala Leu Asp Val Val Leu Thr Pro He Phe Arg He Glu 

1 5 10 15 

Gly Met Ala Pro Met Ser Ser Val Val Asn He Leu Ala Gly He Met 

20 25 30 

Met Gly Pro Val Tyr Ala Leu Ala Met Ala Thr Val Thr Ala Phe He 

35 40 45 

Arg Met Thr Thr Gin Gly Tyr Ser Asp Lys Leu Leu Tyr Leu Ser Gin 

50 55 60 

Glu Arg Leu Leu Glu Pro Phe 
«* 70 

(2) INFORMATION FOR SEQ ID NO: 352: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 103 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:352: 

Met Thr Ala Asn Glu Trp lie Trp Asp Lys Glu Ser Trp Phe Tyr Leu 

15 10 15 

Lys Ser Asp Gly Lys He Ala Glu Lys Glu Trp Val Tyr Asp Ser His 

20 25 30 

Ser Gin Ala Trp Tyr Tyr Phe Lys Ser Gly Gly Tyr Met Ala Lys Asn 

35 40 45 

Glu Thr Val Asp Gly Tyr Gin Leu Glu Ser Asp Gly Lys Trp Leu Gly 

50 55 60 

Gly Lys Ala Thr Asn Glu Asn Ala Ala Tyr Tyr Gin Val Val Pro Val 
65 70 75 80 

Thr Ala Asn Val Tyr Asp Ser Asp Gly Gly Lys Ala Phe Pro He Tyr 

85 90 95 

Arg Lys Val Val Ser Tyr Gly 
100 

(2) INFORMATION FOR SEQ ID NO: 353: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

Leu Asp His Thr He Phe Cys Val Gly Tyr Arg Val Met Gin Lys Asp 

1 5 10 15 

Leu Glu Gly Thr Leu Asp Ala Glu Lys Leu Lys Ala Ala Gly Val Pro 

20 25 30 

Phe Gly Pro Leu Phe Gly Lys He Lys Asn Gly Gin Asp Leu Val Leu 
35 40 45 
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Glu Asp Gly Thr Glu He Lys Ala Ala Asp Tyr He Ser Ala Pro Arg 
50 55 60 

Pro Gly Lys He He Thr He Leu Gly Asp Thr Arg Lys Thr Asp Ala 
65 70 75 80 

Ser Val Arg Leu Ala Val Asn Ala Asp Val Leu Val His Glu Ser Thr 

85 90 95 

Tyr Gly Lys Gly Asp Glu Lys He Ala Arg Asn His Gly His Ser Thr 

100 105 no 

Asn Met Gin Ala Ala Gin Val Ala Val Glu Ala Gly Ala Lys Arg Leu 

115 120 125 

Leu Leu Asn His He Ser Ala Arg Phe Leu Ser Lys Asp He Ser Lys 

130 135 140 

Leu Lys Lys Asp Ala Ala Thr He Phe Glu Asn Val His. Val Val Lys 
145 150 155 160 

Asp Leu Glu Glu Val Glu He 
165 



(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:354: 

Met Pro Thr He Leu He Thr Gly Ala Ser Gly Gly Leu Ala Gin Glu 

1 5 io is 

Met Val Lys Leu Leu Pro Asn Asp Gin Leu He Leu Leu Gly Arg Asn 

20 25 30 

Lys Glu Lys Leu Ala Gin Leu Tyr Gly Asn Tyr Ser His Ala Glu Leu 

35 40 45 

He Glu He Asp He Thr Asp Asp Ser Ala Leu Glu Ala Leu Val Thr 

50 55 60 

Asp Leu Tyr Leu Arg Tyr Gly Lys He Asp Val Leu He Asn Asn Ala 
65 70 75 80 

Gly Tyr Gly He Phe Glu Gly Phe Asp Gin He Ala Asp Lys Asp He 

85 90 95 

His Gin Met Phe Glu Val Asn Thr Phe Ala Leu Met Asn Leu Ser Arg 
100 105 no 
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His Leu Ala Ala Arg Met Lys Glu Ser Ser Lys Gly His lie He Asn 

115 120 125 

He Val Ser Met Ala Gly Leu He Ala Thr Gly Lys Ser Ser Leu Tyr 

130 135 140 

Ser Ala Thr Lys Phe Ala Ala He Gly Phe Ser Asn Ala Leu Arg Leu 
145 150 155 160 

Glu Leu Met Pro Tyr Gly Val Tyr Val Thr Thr Val Asn Pro Gly Pro 

165 170 175 

He Arg Thr Gly Phe Phe Asp Gin Ala Asp Pro Asp Gly Thr Tyr Leu 

180 185 190 

Lys Ser Val Asp Arg Phe Leu Leu Glu Ala Asp Ala Val Ala Lys Lys 

195 200 205 

He Val Lys He He Gly Lys Asn Lys Arg Glu Leu Asn Leu Pro He 

210 215 220 

Leu Leu Asn Leu Ala His Lys Phe Tyr Thr Leu Phe Pro Lys Leu Ala 
225 230 235 240 

Asp Lys Leu Ala Gly Glu Thr Phe Asn Tyr Lys 



(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:355: 

Met Ser Cys Leu Val Trp Asp Gin Ser Gin Gin Lys Asn Asn Ala Lys 

1 5 io 15 

Leu Gly He Glu Asn Leu Gin Asp Leu Leu Leu Tyr Phe Pro Phe Arg 

20 25 30 

Tyr Glu Asp Phe Lys Thr Lys Gin Val Le* Glu Leu Glu Asp Gly Glu 

35 40 45 

Lys Ala Val Leu Ser Gly Gin Val Val Thi to Ala Ser Val Gin Tyr 

50 55 60 

Tyr Gly Phe Lys Arg Asn Arg Leu Arg Phe Ser Leu Lys Gin Gly Glu 
65 70 75 80 

Val Val Phe Ala Val Asn Phe Phe Asn Gin Pro Tyr Leu Ala Asp Lys 
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lie Glu Leu Gly Ala Thr Leu Ala Val Phe Gly Lys Trp Asp Arg Ala 
100 105 no 

Lys Gly 



(2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 363 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 56: 

Met Lys Val Leu Ala Gin Val Glu Asp Asp Leu Gin Pro Val Tyr Arg 

15 10 15 

Leu Ala Gin Gly He Ser Gin Ala Ser Leu Val Lys Val He Lys Thr 

20 25 30 

Ala Phe Asp Gin Gly Leu Asp Leu Leu He Glu Glu Asn Leu Pro Gin 

35 40 45 

Ser Leu Leu Asp Lys Tyr Lys Leu Met Ser Arg Cys Gin Ala Val Arg 

50 55 60 

Ala Met His Phe Pro Lys Asp Leu Ala Glu Asn Lys Gin Ala Leu Arg 
65 70 75 80 

Arg lie Lys Phe Glu Glu Leu Phe Tyr Phe Gin Met Gin Leu Gin Thr 

85 90 95 

Leu Lys Ser Glu Asn Arg Val Gin Gly Ser Gly Leu Val Leu Asp Trp 

100 105 no 

Ser Gin Glu Lys Val Thr Ala Val Lys Ala Ser Leu Pro Phe Ala Leu 

115 120 125 

Thr Gin Ala Gin Glu Lys Ser Leu Gin Glu He Leu Thr Asp Met Lys 

130 135 140 

Ser Asp His His Met Asn Arg Leu Leu Gin Gly Asp Val Gly Ser Gly 



Lys Thr Val Val Ala Gly Leu Ala Met Phe Ala Ala Val Thr Ala Gly 

165 170 175 

Tyr Gin Ala Ala Leu Met Val Pro Thr Glu He Leu Ala Glu Gin His 

180 185 190 

Phe Glu Ser Leu Gin Asn Leu Phe Pro Asn Leu Lys Leu Ala Leu Leu 
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Thr Gly Ser Leu Lys Ala Ala Glu Lys Arg Glu Val Leu Glu Thr He 

210 215 220 

Ala Lys Gly Glu Ala Asp Leu He He Gly Thr His Ala Leu He Gin 
225 230 235 240 

Asp Gly Val Glu Tyr Ala Arg Leu Gly Leu He He He Asp Glu Gin 

245 250 255 

His Arg Phe Gly Val Gly Gin Arg Arg He Leu Arg Glu Lys Gly Asp 

260 265 270 

Asn Pro Asp Val Leu Met Met Thr Ala Thr Pro He Pro Arg Thr Leu 

275 280 285 

Ala He Thr Ala Phe Gly Asp Met Asp Val Ser He He Asp Gin Met 

290 295 300 

Pro Ala Gly Arg Lys Pro He Val Thr Arg Trp He Lys His Glu Gin 
305 310 315 320 

Leu Pro Gin Val Leu Thr Trp Leu Glu Gly Glu He Gin Lys Gly Ser 

325 330 335 

Gin Val Tyr Val He Ser Pro Leu He Glu Glu Ser Gin Ala Pro Arg 

340 345 350 

Phe Glu Lys Met Pro Leu Pro Tyr Gin Arg Ser 
355 360 



(2) INFORMATION FOR SEQ ID NO: 3 57: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 202 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 57: 



Met Lys Asp Leu Val Gin Lys Leu Ala Ser Lys Glu Glu Leu Ser Thr 

15 10 15 

Ala Asp Asn Glu Met He Glu Leu Leu Ala Arg Phe Asn Glu Arg Gin 

20 25 30 

Ala Ser Phe Phe Gly Gin Phe Ser Val Arg Gly Tyr Val Asn Tyr Asp 

35 40 45 

Lys His Val Ala Lys Tyr Leu Lys He Leu Pro Asp Gin Phe Ser Tyr 

50 55 60 

Gin Ala He Glu Asp Val Val Lys Ala Asp Ala Glu Lys Asn Thr Ser 
65 70 75 80 
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Asn Asn Glu Met Gly Met Glu Asn Tyr Phe Tyr Asn Glu Gin lie Lys 

85 90 95 

Lys Asp Leu Lys Lys Leu Lys Asp Ser Gin Lys Ser Phe Thr Tyr Leu 

100 105 HO 

Lys Ser Pro Glu Tyr Asn Asp Leu Gin Leu Val Leu Thr Gin Phe Ser 

115 120 125 

Lys Ser Lys Val Asn Pro lie Phe He He Pro Pro Val Asn Lys Lys 

130 135 140 

Trp Met Asp Tyr Ala Gly Leu Arg Glu Asp Met Tyr Gin Gin Thr Val 
145 150 155 160 

Gin Lys He Arg Tyr Gin Leu Glu Ser Gin Gly Phe Thr Asn He Ala 

165 170 175 

Asn Phe Ser Lys Asp Gly Gly Glu Pro Phe Phe Met Lys Asp Thr He 

180 185 190 

His Leu Gly Trp Leu Gly Leu Val Gly Phe 



(2) INFORMATION FOR SEQ ID NO: 358: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:358: 

Met Lys Gin Leu Ser Ser Ala Gin Val Arg Gin Met Trp Leu Asp Phe 

15 10 15 

Trp Ala Thr Lys Gly His Ser Val Glu Pro Ser Val Ser Leu Val Pro 

20 25 30 

Val Asn Asp Pro Thr Leu Leu Trp He Asn Ser Gly Val Ala Thr Leu 

35 40 45 

Lys Lys Tyr Phe Asp Gly Thr He He Pro Glu Asn Pro Arg He Thr 

50 55 60 

Asn Ala Gin Lys Ala He Arg Thr Asn Asp He Glu Asn Val Arg Glu 

65 70 75 80 

Arg Leu Arg Val Thr He Pro Cys Leu Lys Cys Trp Gly Thr Ser Leu 

85 90 95 

Ser Gly He Thr Ser Val Thr Lys Leu Ser Leu Gly Leu Met Ser Phe 
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(2) INFORMATION FOR SEQ ID NO:359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:359: 

Met He Ala Leu Gly Trp Ser Asn He Gly Ala Ala He Ala Pro Asp 

15 XO is 

Ala Ala Leu Ala Ser Val Ala Ala Ala He He Met Val Leu Gly Gly 

20 25 30 

Asp Phe Thr Lys Thr Gly He Gly Val Ala Gin Ala Val Ala He Pro 

35 40 45 

Leu Ala Val Ala Gly Leu Phe Leu Thr Met He Val Arg Thr He Ser 

50 55 60 

Val Gly Leu Val His Thr Ala Asp Ala Ala Ala Lys Lys Gly Asp Phe 
65 70 75 80 

Gly Ala Val Glu Arg Ala His Phe lie Ala Leu Leu Phe Gin Gly Leu 

85 90 95 

Arg He Ala Leu Pro Ala Ala Leu Leu Pro Tyr Gly Thr Asn 



(2) INFORMATION FOR SEQ ID NO: 360: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360 : 



Met Leu Pro Leu Lys Lys Val Thr Ser Ala Leu Trp Ser Val Arg He 
15 10 15 
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Ser Leu Arg Tyr Phe Ser Lys Asp Phe Val Ser Arg Phe Leu Gin Leu 
20 25 30 

Phe Ser Leu Met Val Pro Thr Glu Thr Val Gin Ser He Leu Ser Ala 

35 40 45 

Met Pro Asp Trp Leu Lys Asp Gly Met Ala He Gly Gly Gly Met Val 

50 55 60 

Val Ala Val Gly Tyr Ala Met Val He Asn Met Met Ala Thr Arg Glu 
65 70 75 bo 

Val Trp Pro Phe Phe Ala Leu Gly Phe Val Leu Ala Ala Val Ser Asp 

85 90 95 

He Thr Leu He Gly Phe Gly Ala He Gly Val Ala He Ala Leu He 

10 <> 105 no 

Tyr Leu His Leu Ser Lys Thr Gly Gly Asn Gly Gly Gly Gly Ala Ala 

120 125 
Thr Ser Asn Asp Pro He Gly Asp He Leu Glu Asp Tyr 
«° 135 140 

(2) INFORMATION FOR SEQ ID NO:361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 

Met Ala Lys Asn Gly Val Asp Gly Val Tyr Asn Ala Asp Pro Lys Lys 

1 5 io 15 

Asp Lys Thr Ala Val Lys Phe Glu Glu Leu Thr His Arg Asp Val He 

20 25 30 

Asn Lys Gly Leu Arg He Met Asp Ser Thr Ala Ser Thr Leu Ser Met 

35 40 45 

Asp Asn Asp He Asp Leu Val Val Phe Asn Met Asn Gin Ser Gly Asn 

50 55 60 

He Lys Arg Val Val Phe Gly Glu Asn He Gly Thr Thr Val Ser Asn 

65 7 ° 75 80 

Asn He Glu Glu Lys Glu 

85 



(2) INFORMATION FOR SEQ ID NO: 3 62: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 



Leu lie Asn Leu Tyr Pro Asp Leu 

1 5 
Leu Gin Glu Pro Ala Ser Val Pro 
20 

Thr Glu Gly Asp Lys Gin Arg Leu 
35 40 

Tyr Glu 
50 



Asp Phe lie Val Thr Thr Val Ala 

10 is 
Phe Val Leu Val Ser Ala Phe Leu 
25 30 
Gin Ala Lys He Gin Glu He Asn 
45 



(2) INFORMATION FOR SEQ ID NO: 363: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 



Met Ser Tyr Phe Lys Lys Tyr Lys 

1 5 
Gly Met Arg Thr Phe Lys Thr Gly 
20 

He Phe Gly Phe Phe Gly Trp Lys 

35 40 
Ala Val Phe Ser Leu Arg Glu Ser 

50 55 
Thr Ser Arg He Leu Gly Asn Ser 
65 70 



Phe Asp Lys Ser Gin Phe Lys Leu 

10 15 
He Ala Val Phe Leu Val Leu Leu 
25 30 
Gly Leu Gin He Gly Ala Leu Thr 
45 

Phe Asp Glu Ser Val His Phe Gly 
60 

He Gly Gly Leu Tyr Ala Leu Val 
75 80 
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Phe Phe Leu Leu Asn Thr Phe Phe His Glu Ala Phe Trp Val Thr Leu 

85 90 95 

Val Val Val Pro lie Cys Thr Met Leu Thr He Met Thr Asn Val Ala 

100 105 no 

Met Asn Asn Lys Ala Gly Val He Gly Gly Val Ala Ala Met Leu He 

115 120 125 

He Thr Leu Ser He Pro Ser Gly Glu Thr He Leu Tyr Val Phe Val 

130 135 140 

Arg Val Leu Glu Thr Phe Met Gly Val Phe Val Ala He He Val Asn 
145 150 155 160 

Tyr Asn He Asp Arg He Arg Leu Phe Leu Glu Lys Lys Glu Lys 
165 170 175 

(2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 

Met lie Ala Lys Glu Phe Glu Thr Phe Leu Leu Gly Gin Glu Glu Thr 

1 5 10 15 

Phe Leu Thr Pro Val Lys Asn Leu Val Val Leu He Asp Thr His Asn 

20 25 30 

Ala Asp His Ala Thr Leu Leu Leu Ser Gin Met Thr Tyr Thr Arg Val 

35 40 45 

Pro Val Val Thr Asp Glu He Gin Leu Arg Trp Arg Gin Asp Val Leu 

50 55 60 

Tyr Gin Gin Val Leu 
65 

(2) INFORMATION FOR SEQ ID NO: 365 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 

Met Ala Lys Gin Thr He He Val Met Ser Asp Ser His Gly Asp Ser 

15 10 15 

Leu He Val Glu Glu Val Arg Asp Arg Tyr Val Gly Lys Val Asp Ala 

20 25 3 0 

Val Phe His Asn Gly Asp Ser Glu Leu Arg Pro Asp Ser Pro Leu Trp 

35 40 45 

Glu Gly He Arg Val Val Lys Gly Asn Met Asp Phe Tyr Ala Gly Tyr 

50 55 60 

Pro Glu Arg Leu Val Thr Glu Leu Gly Ser Thr Lys He He Gin Thr 
65 70 75 80 

His Gly His Leu Phe Asp He Asn Phe Asn Phe Gin Lys Leu Asp Tyr 

85 90 95 

Trp Ala Gin Glu Glu Glu Ala Ala He Cys Leu Tyr Gly His Leu His 

"0 105 no 

Val Pro Ser Ala Trp Leu Glu Gly Lys He Leu Phe Leu Asn Pro Gly 

115 120 125 

Ser He Ser Gin Pro Arg Gly Thr He Arg Glu Cys Leu Tyr Ala Arg 

130 135 140 

Val Glu He Asp Asp Ser Tyr Phe Lys Val Asp Phe Leu Thr Arg Asp 
145 15° 155 160 

His Glu Val Tyr Pro Gly Leu Ser Lys Glu Phe Ser Arg 
165 170 

(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 456 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 

Met Ser Thr Thr He He Gly Phe Pro Arg Leu Gly Glu Phe Arg Glu 
1 5 io 15 
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Leu Lys Phe Thr Thr Glu Lys Tyr Phe Arg Lys Glu He Ser Glu Glu 

20 25 30 

Glu Leu Leu Ala Ala Ala Lys Asp Leu Arg Gly Lys His Trp Asn He 

35 40 45 

Val Lys Gly Lys Gly He Thr Glu He Pro Ser Asn Asp Phe Ser His 

50 55 60 

Tyr Asp Asn Phe Leu Asp Ala Ala Phe Leu Phe Asn Val Val Pro Ala 
65 70 75 80 

Ser Val Gin Asn Leu Asp Leu Ser Asp Leu Glu Arg Tyr Phe Ala Leu 

85 90 95 

Gly Arg Gly Tyr Gin Gly Gly Lys Gly Asp Val Arg Ala Leu Pro Met 

100 105 no 

Lys Lys Trp Phe Asn Thr Asn Tyr His Tyr He Val Pro Lys Phe Glu 

115 120 125 

Lys Asp Thr Gin Val Lys Leu Ala Gly His Lys lie Phe Asp Glu Phe 

130 135 140 

Gin Glu Ala Lys Glu Leu Gly Leu Asn Thr Arg Pro Val Leu Val Gly 
145 150 155 160 

Pro Phe Thr Phe Leu Gin Leu Ser Asp Phe Glu Glu Gly Val Lys Ala 

165 170 175 

Asp Asp Phe Val Asp Ser Leu Val Ala Ala Tyr Gin Glu Val Phe Ala 

180 185 190 

Lys Leu Ala Glu Leu Gly Ala Thr Arg He Gin Leu Asp Glu Ala Ala 

195 200 205 

Leu Val Lys Asp Leu Thr Ala Glu Glu Lys Ala Leu Phe Leu Asn Leu 

210 215 220 

Tyr Asn Lys Leu Leu Ala Asp Lys Lys Gly Leu Glu Val Leu Leu Gin 
225 230 235 240 

Thr Tyr Phe Gly Asp Val Arg Asp Val Tyr Ala Asp Leu Val Asn Leu 

245 250 255 

Pro Val Asp Ala lie Gly Leu Asp Phe Val Glu Gly Lys Lys Thr Leu 

260 265 270 

Glu Leu Val Lys Gly Gly Phe Pro Ala Asp Lys Thr Leu Tyr Val Gly 

275 280 285 

He Val Asn Gly Lys Asn He Trp Arg Asn Asn Tyr Glu Lys Ser Leu 

290 295 300 

Ala Val Leu Glu Gin He Pro Ala Glu Asn lie Val Leu Thr Ser Ser 
305 310 315 320 

Cys Ser Leu Leu His Val Pro Phe Thr Thr Ala Asn Glu Glu Phe Glu 

325 330 335 

Pro Ala Leu Leu Asn His Phe Ala Phe Ala Val Glu Lys Leu Asp Glu 
340 345 350 
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lie Arg Asp Leu Asp Ala He Arg Asn Gly Gin Gly Ser Glu Ala Leu 

355 360 365 

Ala Ala Asn Lys Glu Leu Phe Ala Thr Glu Arg Val Gly Glu Asn Ala 

3? 0 375 380 

Glu Leu Arg Ala Arg He Ala Gly Leu Thr Asp Ala Asp Tyr Thr Arg 
385 390 395 400 

Leu Pro Ala Phe Ala Glu Arg Glu Ala lie Gin Gly Glu Gly Phe Lys 

405 410 415 

Phe Pro Ala Phe Pro Thr Thr Arg He Gly Ser Phe Pro Gin Thr Lys 

«0 425 430 

Glu Val Arg Ala Lys Arg Leu Ala Tyr Arg Lys Gly Glu Leu Ser Gin 

435 440 445 

Lys Glu Tyr Asp Ala Phe Pro Cys 
450 455 



(2) INFORMATION FOR SEQ ID NO: 367 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 

{C) STRAND EDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 



Met Leu Thr Gly He Leu Lys Pro Thr Ser Gly Phe Cys Arg He Asn 

15 10 15 

Gly Lys He Pro Gin Glu Asn Arg Gin Asp Tyr Val Lys Asp He Gly 

20 25 30 

Val Val Phe Gly Gin Arg Thr Gin Leu Trp Trp Asp Leu Ala Leu Gin 

35 40 45 

Glu Thr Tyr Ser Val Leu Lys Glu He Tyr Asp Val Pro Asp Ala Val 

50 55 60 

Phe Gin Lys Arg Met Asp Phe Leu Asn Asp Val Leu Asp Leu Lys Glu 
65 70 75 80 

Phe He Lys Asp Pro Val Arg Thr Leu Ser Leu Gly Gin Arg Met Arg 

85 90 95 

Ala Asp He Ala Ala Ser Leu Leu His Asn Pro Lys Val Leu Phe Leu 

100 105 no 

Asp Glu Pro Thr He Gly Leu Asp Val Ser Val Lys Asp Asn He Arg 
H5 120 125 
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Arg Ala He Thr Gin He Asn Gin Glu Glu Glu Thr Thr He Leu Leu 

130 135 140 

Thr Thr His Asp Leu Ser Asp He Glu Gin Phe Val He Gly Phe Ser 
145 150 155 160 

(2) INFORMATION FOR SEQ ID NO:368: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368 : 

Met Lys val He Asn Gin Thr Leu Leu Glu Lys Val He He Glu Arg 

1 5 io 15 

Ser Arg Ser Ser His Lys Gly Asp Tyr Gly Arg Leu Leu Leu Leu Gly 

20 25 30 

Gly Thr Tyr Pro Tyr Gly Gly Ala He He Met Ala Ala Leu Ala Ala 

35 40 45 

Val Lys Ser Gly Ala Gly Leu Val Thr Val Gly Thr Asp Arg Glu Lys 

50 55 60 

Tyr Pro Gly Ser Ala Gin Pro Phe Thr 
65 70 



(2) INFORMATION FOR SEQ ID NO: 3 69: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:369: 

Met Xaa Val He Asn Gin Ser Leu Leu Xaa Xaa Val He He Asp Arg 
1 5 10 15 
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Pro Arg Ser Ser Leu Lys Gly Asp Tyr Gly Arg Leu Pro Xaa Leu Gly 

20 25 30 

Gly Xaa Tyr Gly Gly Gly Gly Xaa Thr Ser Xaa Ala Ala Xaa Xaa Gly 

. 35 40 45 

Xaa Xaa Xaa Xaa Ala 
50 

(2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 

Met He Asn Arg Tyr Ser Arg Pro Glu Met Ala Asn He Trp Ser Glu 

15 10 15 

Glu Asn Lys Tyr Arg Ala Trp Leu Glu Val Glu He Leu Ser Asp Glu 

20 25 30 

Ala Trp Ala Glu Leu Gly Glu He Pro Lys Glu Asp Val Ala Leu He 

35 40 45 

Arg Lys Lys Ala Asp Phe Asp He Asp Arg He Leu Glu He Glu Gin 

50 55 60 

Glu Thr Arg His Asp Val Val Ala Phe Thr Arg Ala Val Ser Glu Thr 
65 70 75 80 

Leu Gly Glu Glu Arg Lys Trp Val His Tyr Gly Leu Thr Ser Thr Asp 

85 90 95 

Val Val Asp Thr Ala Tyr Gly Tyr Leu Tyr Lys Gin Ala Asn Asp He 

100 105 no 

He Arg Arg Asp Leu Glu Asn Phe Thr Asn He He Ala Asp Lys Ala 

115 120 125 

Lys Glu His Lys Phe Thr He Met Met Gly Arg Thr His Gly Val His 

130 135 140 

Ala Glu Pro Thr Thr Phe Gly Leu Lys Leu Ala Thr Trp Tyr Ser Glu 
145 150 155 160 

Met Lys Arg Asn lie Gly Ala Leu Arg Ala Cys Gly Cys Trp Cys Arg 
165 170 175 

Ser Trp 
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(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 

Met Lys Thr Asn Asp He Val Tyr Gly Val His Ala Val Thr Glu Ala 

15 10 15 

Leu Leu Ala Asn Thr Gly Asn Lys Leu Tyr Leu Gin Glu Asp Leu Arg 

20 25 30 

Gly Lys Asn Val Glu Lys Val Lys Glu Leu Ala Val Thr Phe 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 221 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 

Met Leu Leu Ser Ala Phe His Lys Tyr Glu Asn Gin Leu Asp Lys Val 

1 5 10 15 

Arg Phe He Gly Leu His Thr Gly His Leu Gly Phe Tyr Thr Asp Tyr 

20 25 30 

Arg Asp Phe Glu Leu Asp Lys Leu Val Thr Asn Leu Gin Leu Asp Thr 

35 40 45 

Gly Ala Arg Val Ser Tyr Pro Val Leu Asn Val Lys Val Phe Leu Glu 

50 55 60 

Asn Gly Glu Val Lys He Phe Arg Ala Leu Asn Glu Ala Ser He Arg 
65 7 0 75 80 
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Arg Ser Asp Arg Thr Met Val Ala Asp He Val He Asn Gly Val Pro 
85 90 95 

Phe Glu Arg Phe Arg Gly Asp Gly Leu Thr Val Ser Thr Pro Thr Gly 

100 105 no 

Ser Thr Ala Tyr Asn Lys Ser Leu Gly Gly Ala Val Leu His Pro Thr 

115 120 125 

He Glu Ala Leu Gin Leu Thr Glu lie Ala Ser Leu Asn Asn Arg Val 

130 135 140 

Tyr Arg Thr Leu Gly Ser Ser He He Val Pro Lys Lys Asp Lys He 
145 "0 155 i 6 o 

Glu Leu He Pro Thr Arg Asn Asp Tyr His Thr He Ser Val Asp Asn 

165 170 175 

Ser Val Tyr Ser Phe Arg Asn He Glu Arg lie Glu Tyr Gin He Asp 

18° 185 190 

His His Lys He His Phe Val Ala Thr Pro Ser His Thr Ser Phe Trp 

195 200 205 

Asn Arg Val Lys Asp Ala Phe He Gly Glu Val Asp Glu 
210 215 220 



(2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 137 amino acids 
<B) TYPE: amino acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 



Met Leu Val Pro Lys Arg Val Lys His Arg Arg Glu Phe Arg Gly Lys 

1 5 io is 

Met Arg Gly Glu Ala Lys Gly Gly Lys Glu Val Ala Phe Gly Glu Tyr 

20 25 30 

Gly Leu Gin Ala Thr Thr Ser His Trp lie Thr Asn Arg Gin He Glu 

35 40 45 

Ala Ala Arg He Ala Met Thr Arg Tyr Met Lys Arg Gly Gly Lys Val 

50 55 go 

Trp He Lys He Phe Pro His Lys Ser Tyr Thr Ala Lys Ala He Gly 
65 ™ 75 80 

Val Arg Met Gly Ser Gly Lys Gly Ala Pro Glu Gly Trp Val Ala Pro 
85 go 95 
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Val Lys Arg Gly Lys Val Met Phe Glu He Ala Gly Val Ser Glu Glu 

100 105 110 

He Ala Arg Glu Ala Leu Arg Leu Ala Ser His Lys Leu Pro Val Lys 

115 120 125 

Cys Lys Phe Val Lys Arg Glu Ala Glu 
130 135 



(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 221 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 



Met Tyr Thr Val Ser Gly He Ala Tyr Arg Trp Phe Ser Thr He Leu 

1 5 10 is 

Ala Leu Val Ser Ala Thr Ser Val Phe Val Leu Thr Thr He Ser Leu 

20 25 30 

He Gly Val Glu Thr Phe Ser Lys He Pro Val Phe Gly Tyr Val Ala 

35 40 45 

Lys Arg Phe Ser Ala Phe Phe Asn Pro Phe Ala Asp Arg Ala Asp Ala 

50 55 60 

Gly His Gin Leu Ala Asn Ser Tyr Phe Ala Met Val Asn Gly Gly Trp 
65 70 75 80 

Phe Gly Leu Gly Leu Gly Asn Ser He Glu Lys Arg Gly Tyr Leu Pro 

85 90 95 

Glu Ala His Thr Asp Phe Val Phe Ser He Val He Glu Glu Phe Gly 

100 105 no 

Phe Val Gly Ala Ser Leu He Leu Ala Leu Leu Phe Phe Met He Leu 

H* 120 125 

Arg He He Leu Val Gly He Arg Ala Glu Asn Pro Phe Asn Ala Met 

130 135 140 

Val Ala Leu Gly Val Gly Gly Met Met Leu Val Gin Val Phe Val Asn 
145 150 155 160 

He Gly Gly He Ser Gly Leu He Pro Ser Thr Gly Val Thr Phe Pro 

165 170 175 

Phe Leu Ser Gin Gly Gly Asn Ser Leu Leu Val Leu Ser Val Ala Val 
180 185 190 
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Ala Phe Val Leu Asn lie Asp Ala Ser Glu Lys Arg Ala Lys Leu Tyr 

195 200 205 

Arg Glu Leu Glu Asn Gin Pro Met Asn Leu Leu Leu Lys 
210 215 220 



(2) INFORMATION FOR SEQ ID NO: 375: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 



Met Leu Leu lie Val lie lie He Gly Thr Phe Tyr Phe Phe Asn Arg 

15 10 is 

Ala Arg Tyr Leu Gly Val Thr Tyr Tyr Ser Arg Phe His Phe Thr He 

20 25 30 

Leu Gly Cys Phe Phe Leu Thr Leu Ala He Thr Ala Leu Leu Met Leu 

35 40 45 

Gin Asn Tyr Gin Phe Asn He Glu He Tyr Gin His Asn Pro Leu Asn 

50 55 60 

Phe Lys Tyr Leu Ser Ala Trp Val He Thr Tyr Val He Tyr Leu Pro 
65 70 75 80 

Trp Val Phe He Gly Asn Leu Gly Leu Lys Ser Tyr Gly Glu Trp Ala 

85 90 95 

Gin Lys Lys Phe Glu Gin Asp Met Asp Glu Leu Glu Ser Gly Glu 
100 105 no 



(2) INFORMATION FOR SEQ ID NO: 376: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 6: 
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Met Leu Asp Val Glu Ala lie Arg 

1 5 
He Val Asn Asp Glu Pro Leu Val 
20 

Gin Lys Pro Leu Val Val Leu Lys 

35 40 
Asp Asn Ala Asn Val His Arg Gly 

50 55 
Thr Ala Ser Leu 
65 



Lys Asp Phe Pro He Leu Asp Gin 

10 15 
Tyr Leu Asp Asn Ala Ala Thr Thr 
25 30 
Ala He Asn Ser Tyr Tyr Glu Gin 
45 

Val His Thr Leu Ala Glu Arg Ala 
60 



(2) INFORMATION FOR SEQ ID NO: 377 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 174 amino acids 
<B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

Met Gly Phe Asn Gin Pro Leu Ser Trp Leu He Leu His Ala Lys Gly 

15 10 15 

Leu Thr Glu Ser Phe He Lys Ala Ser Ala Gin Thr Gly Ala Asp Arg 

20 25 30 

He Thr Tyr He Ser Cys Asn Val Arg Asn His Gly Ala Val Asp He 

35 40 45 

Lys Leu Tyr Gin Glu Leu Gly Tyr Glu Leu Lys Lys Val Gin Pro Val 

50 55 60 

Asp Leu Phe Xaa Gin Thr His His Val Glu Thr Val Ala Leu Leu Ser 
65 70 75 80 

Lys Leu Asp Val Asp Lys His He Ser Val Glu He Glu Leu Asp Glu 

85 90 95 

Met Asp Leu Thr Ser Ala Glu Ser Lys Ala Thr Tyr Ala Gin He Lys 

100 105 no 

Glu Tyr Val Trp Asn Lys Phe Glu Leu Lys Val Ser Thr Leu Tyr He 

115 120 125 

Ala Gin He Lys Lys Lys Cys Gly He Glu Leu Arg Glu His Tyr Asn 
130 135 140 
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Lys Xaa Lys Lys Asp Lys Gin He He Pro Gin Cys Thr Pro Glu Lys 
145 150 155 160 

Glu Glu Ala He Met Asp Ala Leu Arg Xaa Phe Lys Met He 
165 170 



(2) INFORMATION FOR SEQ ID NO: 378: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:378: 

Leu Thr Glu Ala Gly Tyr Val Gly Glu Asp Val Glu Asn He Leu Leu 

15 10 15 

Lys Leu Leu Gin Val Ala Asp Phe Asn He Glu Arg Ala Glu Arg Gly 

20 25 30 

He He Tyr Val Asp Glu He Asp Lys He Ala Lys Lys Ser Glu Asn 

35 40 45 

Val Ser He Thr Arg Asp Val Ser Gly Glu Gly Val Gin Gin Ala Leu 

50 55 60 

Leu Lys He lie Glu Gly Thr Val Ala Ser Val Pro Pro Gin Gly Gly 
65 7 0 75 80 

Arg Lys His Pro Gin Gin Val Asp Asp Ser Ser Gly Tyr Lys Lys Tyr 

85 90 95 

Pro Leu His Arg Gly Trp Cys Phe 
100 



(2) INFORMATION FOR SEQ ID NO:379: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 
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Met Gin Glu He He Ala Glu Asp He Gin Lys Phe Gly He He Pro 

15 10 15 

Glu Leu He Gly Arg Leu Pro Val Phe Ala Ala Leu Glu Gin Leu Thr 

20 25 30 

Val Asp Asp Leu Val Arg He Leu Lys Glu Pro Arg Asn Ala Leu Val 

35 40 45 

Lys Gin Tyr Gin Thr Leu Leu Ser Tyr Asp Asp Val Glu Leu Glu Phe 

50 55 60 

Asp Asp Glu Ala Leu Gin Glu He Ala Asn Lys Ala lie Glu Arg Lys 
65 70 75 80 

Thr Gly Ala Arg Gly Leu Arg Ser lie lie Glu Glu Thr Met Leu Asp 

85 90 95 

Val Met Phe Glu Val Pro Ser Gin Glu Asn Val Lys Leu Val Arg He 

100 105 no 

Thr Lys Glu Thr Val Asp Gly Thr Asp Lys Pro He Leu Glu Thr Ala 
H5 120 125 

(2) INFORMATION FOR SEQ ID NO: 3 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:380: 

Met Glu Gly Gin Ser Ala Met Gin Ser Gin Glu Gin Asp Ser Asn Gly 

1 5 10 15 

Ala Val Lys Ser lie Val Ser Lys Ser Pro Gly Ala lie Ser Tyr Leu 

20 25 30 

Ser Leu Thr Tyr lie Asp Asp Ser Val Lys Ser Met Lys Leu Asn Gly 

35 40 45 

Tyr Asp Leu Ser Pro Glu Asn lie Ser Ser Asn Asn Trp Pro Leu Trp 

50 55 go 

Ser Tyr Glu His Met Tyr Thr Leu Gly Gin Pro Asn Glu Leu Ala Ala 
65 7 ° 75 so 

Glu Phe Leu Asn Phe Val Leu Ser Asp Glu Thr Gin Glu Gly lie Val 
85 90 95 
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Lys Gly Leu Lys Tyr lie Pro ile Lys Glu Met Lys Val Glu Lys Asp 
100 105 no 

Ala Ala Gly Thr Val Thr Val Leu Glu Gly Arg Gin 



(2) INFORMATION FOR SEQ ID NO: 381: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 81: 

Met Glu Ile Lys Ile Glu Thr Gly Gly Leu Arg Leu Asp Lys Ala Leu 

15 10 15 

Ser Asp Leu Ser Glu Leu Ser Arg Ser Leu Ala Asn Glu Gin Ile Lys 

20 25 30 

Ser Gly Gin Val Leu Val Asn Gly Gin Val Lys Lys Ala Lys Tyr Thr 

35 40 45 

Val Gin Glu Gly Asp Val Val Thr Tyr His Val Pro Glu Pro Glu Val 

50 55 60 

Leu Glu Tyr Val Ala Glu Asp Leu Pro Leu Glu Ile Val Tyr Gin Asp 
65 70 75 80 

Glu Asp Val Ala Val Val Asn Lys Pro Gin Gly Met Val Val His Pro 

85 90 95 

Ser Ala Gly His Thr Ser Gly Thr Leu Val Asn Ala Leu Met Tyr His 

100 105 no 

Ile Lys Asp Leu Ser Gly Ile Asn Gly Val Leu Arg Pro Gly Ile Val 

115 120 125 

His Arg Ile Asp Lys Asp Thr Ser Gly Leu Leu Met lie Ala Lys Asn 

130 135 140 

Asp Asp Ala His Leu Ala Leu Ala Gin Glu Leu Lys Asp Lys Lys Ser 
145 150 155 160 

Leu Pro Gin Ile Leu Gly Asp Cys Ser Met Glu Ile Cys Leu Met Ile 

165 170 175 

Val Val Leu Ile Glu Thr Pro Glu Leu Ala Asn Glu Lys Glu Pro 



115 



120 



180 



185 



190 



(2) INFORMATION FOR SEQ ID NO: 382: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

Met Lys He Leu Val Thr Gly Phe Asn Pro Phe Gly Gly Glu Lys lie 

15 10 15 

Asn Pro Ala Leu Glu Ala Val Lys Leu Leu Pro Ser Glu He Asn Gly 

20 25 30 

Ala Glu Val Arg Trp Val Glu He Pro Thr Val Phe Tyr Lys Ser Ser 

35 40 45 

Glu Val Leu Glu Ala Glu He Leu Arg Tyr Gin Pro Asp Ala Val Leu 

50 55 60 

Cys He Gly Gin Ala Gly Gly Arg Thr Gly Leu Thr Pro Glu Arg Val 
65 70 75 80 

Ala He Asn Gin Asp Asp Ala Arg He Pro Asp Asn Glu Gly Asn Gin 

85 90 95 

Pro He Asp Thr Pro lie Arg He Asp Gly Ala Ser Ala Tyr Phe Ser 

100 105 no 

Ser Leu Pro He Lys Ala Met Val Gin Ala He Lys Lys Gin Gly Leu 

115 120 125 

Pro Ala Val Val Ser Asn Ser Ala Gly Thr Phe Val Cys Asn His Leu 

13 0 135 140 

Met Tyr Gin Ala Leu Tyr Leu Val Asp Lys Glu He Pro 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:383: 
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Met Leu Arg Ala Gly Phe His Ala Tyr Ser lie Tyr Asp Gly Thr Ser 

15 10 15 

Ser Glu Gin Lys Pro Asn Thr Pro Asp Tyr Glu Phe Tyr Val Asp He 

20 25 30 

Arg Arg Lys Gly He Glu Ala Ala lie Gly Ala Met He Glu His Gly 

35 40 45 

Asp Gin Glu Leu Lys Leu Val Gly Gly Glu He His 



(2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 

Met Lys Phe Arg Lys Leu Ala Cys Thr Val Leu Ala Gly Ala Ala Val 

1 5 10 15 

Leu Gly Leu Ala Ala Cys Gly Asn Ser Gly Gly Ser Lys Asp Ala Ala 

20 25 30 

Lys Ser Gly Gly Asp Gly Ala Lys Thr Glu He Thr Trp Trp Ala Phe 

35 40 45 

Pro Val Phe Thr Gin Glu Lys Thr Gly Asp Gly Val Gly Thr Tyr Glu 

50 55 60 

Lys Ser He He Glu Ala Phe 
65 70 

{2) INFORMATION FOR SEQ ID NO: 385: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



50 



55 



60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

Met Ala Leu Gly Thr Thr Leu Thr Val Gin Ala He Cys Leu Gin Thr 

15 10 15 

Ala Gly Gly Ser Thr Gin Thr Ala Thr Gly Thr Gly Ser Thr Thr Pro 

20 25 30 

Gly Glu Met Ala Thr Gly Tip Lys Ser Pro Thr Leu Leu Phe Phe Asn 

35 40 45 

Thr Ser Tyr Phe Asp Leu Thr Gly Trp Ser 
50 55 

(2) INFORMATION FOR SEQ ID NO:386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 86: 

Met Glu Ala Glu Leu Lys Lys Leu Glu Val Asn Leu Val Asn Leu Ala 

15 10 15 

Gly Gin Leu Ala Ser Ala Arg His Asn Leu Ala Gin Gin Leu Glu Ala 

20 25 30 

Glu He Lys Gin Glu Leu Gin Asp Leu Tyr Met Glu Lys Ala Gin Phe 

35 40 45 

Gin Val Arg Phe Ser Lys Gly Lys Phe Ser Arg Glu Gly Asn Lys Met 

50 55 60 

Val Glu Phe Tyr He Ser Thr Asn Pro Gly Glu Asp Phe Lys Pro Leu 
65 70 75 80 

Val Lys Val Ala Ser Gly Gly Glu Leu Ser Arg Leu Met Leu Ala He 

85 so 95 

Lys Ser Ala Phe Ser Arg Lys Glu Gly Lys Thr Ser He Val Phe Asp 

100 105 no 

Glu Val Glu Thr Gly Val Ser Gly Arg Val Ala Gin Ala He Ala Gin 

115 120 125 

Lys He His Lys He Gly Gin His Gly Gin Val Leu Ala He Ser His 

130 135 140 

Leu Pro Gin Val He Ala lie Ala Asp Tyr Gin Phe Phe He Glu Lys 



145 



150 



155 



160 
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lie Ser Asn Asp His Ser Thr Val Ser Thr Val Arg Leu Leu Thr Val 

165 170 175 

Glu Glu Arg Val Glu Glu Val Ala Lys Met Leu Ala Gly Asp Asp Val 

180 185 190 

Thr Glu Ala Ala Leu Thr Gin Ala Arg Glu Leu Leu Arg Asn Arg Glu 
195 200 205 

Lys 



(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:387: 

Met Met Gly Glu Asn Leu Phe Arg Ser He Lys He Arg Gly Glu Phe 

15 10 15 

Leu His Met His Val Arg Met He Pro Lys Ser Thr Pro Asp Thr Lys 

20 25 30 

Phe Ala Asp Val Ala Thr His Gin Pro Glu Tyr Ser Arg Asp Asn Val 

35 40 45 

Ala Gly Thr He Val Gly Phe Trp Thr Pro Glu He Phe His Gly Val 

50 55 60 

Ser Val Ala Gly Tyr His Leu His Phe He Ser Asp Asp Leu Thr Phe 
65 70 75 80 

Gly Gly His Val Met Asp Phe Val He Lys Glu Gly He He Glu Val 

85 90 95 

Gly Ala Val Asp Gin Leu Asp Gin Arg Phe Pro Val Gin Asp Arg Gin 

100 105 no 

Tyr Leu Phe Ala Lys Phe Asn Val Asp Glu Met Lys Lys Asp He Glu 

115 120 125 

Lys Ala Glu 



130 



(2) INFORMATION FOR SEQ ID NO: 388 : 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

Met Ala Gly Ser Ser Arg Asp His Ala Ala Trp Ala Leu Ala Asp Tyr 

1 5 10 15 

Gly Phe Lys Val Val He Ala Gly Ser Phe Gly Asp He His Tyr Asn 

20 25 30 

Asn Glu Leu Asn Asn Gly Met Leu Pro He Val Gin Pro Arg Glu Val 

35 40 45 

Arg Glu Lys Leu Ala Gin Leu Lys Pro Thr Asp Gin Val Thr Val Asp 

50 55 60 

Leu Glu Gin Gin Lys He He Ser Pro Val Glu Glu Phe Thr Phe Glu 
65 70 75 80 

He Asp Ser Lys Trp Lys His Lys Leu Leu Asn Ser Leu Asp Asp He 

85 90 95 

Gly He Thr Leu Gin Tyr Glu Glu Leu He Ala Ala Tyr Glu Lys Gin 

100 105 110 

Arg Pro Ala Tyr Trp Gin Asp 
115 

(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:389: 

Met Asn He Tyr Asp Gin Leu Gin Ala Val Glu Asp Arg Tyr Glu Glu 

1 5 10 is 

Leu Gly Glu Leu Leu Ser Asp Pro Asp Val Val Ser Asp Thr Lys Arg 
20 25 30 
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Phe Met Glu Leu Ser Lys Glu Glu Ala Ser Asn Arg Asp Thr 



35 



Val He 



40 



45 



Ala Tyr Arg Glu Tyr Lys Gin Val Leu Gin Asn lie Val Asp Ala Glu 

50 55 60 

Glu Met lie Lys Glu Ser Gly Gly Asp Ala Asp Leu Glu Glu Leu Ala 
65 70 75 80 

Lys Gin Glu Leu Lys Asp Ala Lys Ala Glu Lys Glu Glu Tyr Glu Glu 

85 *° 95 

Lys Leu Lys lie Leu Leu Leu Pro Lys Asp Pro Asn Asp Asp Lys Asn 



100 



105 



110 



lie lie Leu Glu He Arg Gly Ala Ala Gly Gly Asp Glu Ala Ala 



115 



Leu 



120 



125 



Phe Ala Gly Asp Leu Leu Thr Met Tyr Gin Lys Tyr Ala Glu Ala Gin 

130 "5 140 

Gly Trp Arg Phe Glu Val Met Glu Ala Ser Met Asn Gly Val Gly Gly 

145 150 1EC 

155 160 
Phe Lys Glu Val Val Ala Met Val Ser Gly Gin Ser Val Tyr Ser Lys 

165 170 175 

Leu Lys Tyr Glu Ser Gly Ala His Arg Val Gin Arg Val Pro Val Thr 

180 «5 190 

Glu ser Gin Gly Arg Val His Thr Ser Thr Ala Thr Val Leu Val Met 

195 200 205 

Pro Glu Val Glu Glu Val Glu Tyr Asp He Asp Pro Lys Asp Leu Arg 



210 



215 



220 



Val Asp lie Tyr His Ala Ser Gly Ala Gly Gly Gin Asn Val Asn Lys 
225 230 235 24 o 

Val Ala Thr Ala Val Arg lie Val His Leu Pro Thr Asn lie Lys Val 

245 250 255 

Glu Met Gin Glu Glu Arg Thr Gin Gin Lys Asn Arg Glu Lys Ala Met 

260 26S 270 

Lys lie He Arg Ala Arg Val Ala Asp His Phe Ala Gin lie Ala Gin 

275 280 285 

Asp Glu Gin Asp Ala Glu Arg Lys Ser Thr lie Gly Thr Gly Asp Arg 

290 -jo* 

295 300 

Ser Glu Arg lie Arg Thr Tyr Asn Phe Pro Gin Asn Arg Val Thr Asp 

305 310 „ e 

315 320 

His Arg lie Gly Leu Thr Leu Gin Lys Leu Asp Thr He Leu Ser Gly 

325 330 335 

Lys Leu Asp Glu Val Val Asp Ala Leu Val Leu Tyr Asp Gin Thr Gin 

340 345 35 0 

Lys Leu Glu Glu Leu Asn Lys 
355 
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(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 200 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

Met Asn Asn Asn Phe Asn Asn Phe Asn Asn Met Asp Asp Leu Phe Asn 

15 10 is 

Gin Leu Met Gly Gly Met Arg Gly Tyr Ser Ser Glu Asn Arg Arg Tyr 

20 25 30 

Leu He Asn Gly Arg Glu Val Thr Pro Glu Glu Phe Ala His Tyr Arg 

35 40 45 

Thr Thr Gly Gin Leu Pro Gly Asn Ala Glu Thr Asp Val Gin Met Pro 

50 55 60 

Gin Gin Ala Ser Gly Met Lys Gin Asp Gly Val Leu Ala Lys Leu Gly 
65 70 75 80 

Arg Asn Leu Thr Ala Glu Ala Arg Glu Gly Lys Leu Asp Pro Val He 

85 90 95 

Gly Arg Asn Lys Glu He Gin Glu Thr Ser Glu He Leu Ser Arg Arg 

100 105 no 

Thr Lys Asn Asn Pro Val Leu Val Gly Asp Ala Gly Val Gly Lys Thr 

115 120 125 

Ala Val Val Glu Gly Leu Ala Gin Ala He Val Asn Gly Asp Val Pro 

I 30 135 140 

Ala Ala He Lys Asn Lys Glu He He Ser He Asp He Ser Gly Leu 
145 150 155 160 

Glu Ala Gly Thr Gin Tyr Arg Gly Ser Phe Glu Glu Asn Val Gin Asn 

155 170 175 

Leu Val Asn Glu Val Lys Glu Ala Gly Asn He He Leu Phe Phe Asp 

180 185 190 

Glu He His Asn Ser Trp Cys Trp 
195 200' 



(2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 292 amino acids 

361 



WO 97/43303 



PCT/US97/07950 



(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 

Met Gin Gly Gin lie He Lys Ala Leu Ala Gly Phe Tyr Tyr Val Glu 

15 10 is 

Ser Asp Gly Gin Val Tyr Gin Thr Arg Ala Arg Gly Asn Phe Arg Lys 

20 25 30 

Lys Gly His Thr Pro Tyr Val Gly Asp Trp Val His Phe Ser Ala Glu 

35 40 45 

Glu Asn Ser Glu Gly Tyr He Leu Lys He His Glu Arg Lys Asn Ser 

50 55 60 

Leu Val Arg Pro Pro He Val Asn He Asp Gin Pro Val Val He Met 
65 70 75 80 

Ser Val Lys Glu Pro Asp Phe Asn Ser Asn Leu Leu Asp Arg Phe Leu 

85 90 95 

Val Leu Leu Glu His Lys Gly He His Pro He Val Tyr He Ser Lys 

100 105 no 

Met Asp Leu Leu Glu Asp Arg Gly Glu Leu Asp Phe Tyr Arg Gin Thr 

115 120 125 

Tyr Gly Asp He Gly Tyr Asp Phe Val Thr Ser Lys Glu Glu Leu Leu 

130 135 140 

Ser Leu Leu Thr Gly Lys Val Thr Val Phe Met Gly Gin Thr Gly Val 
I 4 * 150 155 160 

Gly Lys Ser Thr Leu Leu Asn Lys Leu Val Pro Asp Leu Asn Leu Glu 

165 170 175 

Thr Gly Glu He Ser Asp Ser Leu Gly Arg Gly Arg His Thr Thr Arg 

180 185 190 

Ala Val Ser Phe Tyr Asn Leu Asn Gly Gly Lys He Ala Asp Thr Pro 

195 200 205 

Gly Phe Ser Ser Leu Asp Tyr Glu Val Ser Arg Ala Glu Asp Leu Asn 

210 215 220 

Gin Ala Phe Pro Glu He Ala Thr Val Ser Arg Asp Cys Lys Phe Arg 
225 230 235 240 

Thr Cys Thr His Thr His Glu Pro Ser Cys Ala Val Lys Pro Ala Val 

245 250 255 

Glu Glu Gly Val He Ala Thr Phe Arg Phe Asp Asn Tyr Leu Gin Phe 
260 265 270 
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Leu Ser Glu He Glu Asn Arg Arg Glu Thr Tyr Lys Lys Val Ser Lys 
275 280 285 

Lys He Pro Lys 
290 



(2) INFORMATION FOR SEQ ID NO: 392: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 305 amino acids 

(B) TYPE: amino acid 

(C) STRANDSDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 

Met He He Thr He Pro He Lys Asn Gin Lys Asp He Gly Thr Pro 

1 5 10 15 

Ser Asp Ser Val Val Val Leu Gly Tyr Phe Asp Gly He His Lys Gly 

20 25 30 

His Gin Glu Leu Phe Arg Val Ala Asn Lys Ala Ala Arg Lys Asp Leu 

35 40 45 

Leu Pro He Val Val Met Thr Phe Asn Glu Ser Pro Lys He Ala Leu 

50 55 go 

Glu Pro Tyr His Pro Asp Leu Phe Leu His He Leu Asn Pro Ala Glu 
65 7 ° 75 so 

Arg Glu Arg Lys Leu Lys Arg Glu Gly Val Glu Glu Leu Tyr Leu Leu 

85 go 95 

Asp Phe Ser Ser Gin Phe Ala ser Leu Thr Ala Gin Glu Phe Phe Ala 

100 105 110 

Thr Tyr He Lys Ala Met Asn Ala Lys He He Val Ala Gly Phe Asp 

115 120 125 

Tyr Thr Phe Gly Ser Asp Lys Lys Thr Ala Glu Asp Leu Lys Asp Tyr 

130 • «5 140 

Phe Asp Gly Glu Val He He Val Pro Pro Val Glu Asp Glu Lys Gly 
145 15 ° 155 160 

Lys He Ser Ser Thr Arg He Arg Gin Ala He Leu Asp Gly Asn Val 

165 170 175 

Lys Glu Ala Gly Lys Leu Leu Gly Ala Pro Leu Pro Ser Arg Gly Met 

180 185 190 

Val Val His Gly Asn Ala Arg Gly Arg Thr He Gly Tyr Pro Thr Ala 
I 95 200 205 
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Asn Leu Val Leu Leu Asp Arg Thr Tyr Met Pro Ala Asp Gly Val Tyr 

210 215 220 

Val Val Asp Val Glu He Gin Arg Gin Lys Tyr Arg Ala Met Ala Ser 
225 230 235 240 

Val Gly Lys Asn Val Thr Phe Asp Gly Glu Glu Ala Arg Phe Glu Val 

245 250 255 

Asn He Phe Asp Phe Asn Gin Asp He Tyr Gly Glu Thr Val Met Val 

260 265 270 

Tyr Trp Leu Asp Arg He Arg Asp Met Thr Lys Phe Asp Ser Val Asp 

275 280 285 

Gin Leu Val Asp Arg Leu Lys Ala Asp Glu Glu Val Thr Arg Asn Trp 
290 295 300 

Ser 



(2) INFORMATION FOR SEQ ID NO: 393: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393 : 

Met Ala Thr Thr Glu Leu He Glu Leu Ala He Glu Thr Ser Lys His 

15 10 15 

Ala Tyr Val Pro Tyr Ser His Phe Pro He Gly Ala Val Leu Val Ala 

20 25 30 

Lys Asp Gly Asn Val Tyr Thr Gly Val Asn He Glu Asn Ala Ser Tyr 

35 40 45 

Pro Leu Thr Asn Cys Gly Glu Arg Thr Ala He Phe Lys Ala He Ser 

50 55 60 

Glu Gly Gin Arg Glu Phe Ser Glu Leu He Val Tyr Gly Gin Thr Glu 
65 70 75 80 

Lys Pro He Ser Pro Cys Gly Ala Cys Arg Gin Val Met Val Glu Phe 

85 90 95 

Phe Glu Gin Asp Leu Lys Val Thr Leu Val Ala Lys Asp Lys Ser Thr 



305 



100 



105 



110 
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Val Glu Met Thr Val Gly Glu Leu Leu Pro Tyr Ser Phe Thr Asp Leu 
115 120 125 

Asn 



(2) INFORMATION FOR SEQ ID NO: 394: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 94: 

Met Asp Leu Met Arg Thr Phe Asp Lys Val Asp Ala Leu Leu Asp Gin 

15 10 15 

Leu Ser Ala Asn Ser Lys Asp Lys Thr Ala Leu Leu Glu Ser Thr Lys 

20 25 30 

Ala lie Leu Val Leu Asn His Gin lie Gin Ser Lys Ala Ser Ala Ser 

35 40 45 

Glu Glu Thr Ser Pro Ala Arg Asn Ala Glu Ala Asn Gly Asp Asn Thr 

50 55 60 

Ser Ala Glu Asn Gin Pro Asn Ala Thr Ala Glu Ser Asn lie Glu Thr 
65 70 75 80 

Ala Ser Asp Glu Asn Lys Pro Ser Asn Thr Arg Asp Ser Lys Pro Ala 

85 90 95 

Glu Ser Thr Ser Glu Asn Lys Thr Thr Glu Ser Ser Thr Thr Thr Gly 

100 105 no 

Asn Gin Glu Lys Pro Val Glu 
115 

(2) INFORMATION FOR SEQ ID NO: 395 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 

Met Val Arg Tyr Val lie Glu Lys Ser Gly His Thr Asp Trp Asp Gly 

15 10 is 

Arg Lys lie Tyr Gin Glu Ala Ala Ala Gly Asn Ala Leu Cys Gin Glu 

20 25 30 

Ala He Glu Arg Met Asn Arg Asn Leu Ala Gin Gly Leu Leu Asn He 

35 40 45 

Gin Tyr Leu He Asp Pro Asp Val He Ser Leu Gly Gly Ser He Ser 

50 55 60 

Gin Asn Pro Asp Phe He Gin Asp Val Lys Lys Ala Val Asp Asn Phe 
65 70 75 80 

Val Asp Thr Tyr Glu Glu Tyr Thr Val Ala Pro Val He Gin Ala Cys 

85 90 95 

Thr Tyr His Ala Asp Ala Asn Leu Tyr Gly Ala Leu Val Asn Trp Leu 

100 105 no 

Gin Glu Glu Lys Gin Trp 



(2) INFORMATION FOR SEQ ID NO: 3 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 

Met Gin Gly Asp Phe Gin Ser Asn He Tyr Glu Gin Val Glu Gly His 

15 io 15 

Phe Asp His Val He Ser Asn Pro Pro He Arg Ala Gly Lys Gin Val 

20 25 30 

Val His Glu He He Glu Lys Ser Lys Asp Phe Leu Glu Thr Gly Gly 

35 40 45 

Asp Leu Thr He Val He Gin Lys Lys Gin Gly Ala Pro Ser Ala Lys 

50 55 60 

Ser Lys Met Glu Asp Val Phe Gly Asn Cys Glu lie Leu Lys Lys Asp 
65 70 75 80 



115 
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Lys Gly Tyr Tyr lie Leu Arg Ser Val Lys Glu 
85 90 

(2) INFORMATION FOR SEQ ID NO: 3 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 397: 

Met Asp Gly Val Leu Arg Asn Leu Leu Phe Gly Gin Tyr Glu Gly Trp 

1 5 10 is 

Arg Thr Glu Phe His Gly Glu His Ala Leu Gly Lys Asp Ser Ser Gin 

20 25 30 

Tyr He Leu Thr Asp Gin Trp Lys Phe He Trp Phe Pro Val Leu Asn 

35 40 45 

His Tyr Gin Leu Phe Asp Met Lys Lys Asp Pro His Glu Met Asn Asp 

50 55 60 

Leu Tyr Pro Ser Glu Lys Tyr Gin Pro He Val Arg Gin Met Lys Lys 
65 70 75 80 

Lys Leu Val Asp Phe Leu Arg Tyr Arg Glu Glu Gly Phe Val Val Asp 

85 90 95 

Glu Glu Leu Val Pro Val Glu Leu Ser Lys He Thr Pro Thr Leu Thr 

100 105 HO 

Lys Thr Gly Asp Ser Gin Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 amino acids 

(B) TYPE: amino acid 

<C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 98: 
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Leu Phe Ser lie He Leu Ser Val Ala Leu He Ser Leu Val Ser Leu 

1 5 10 15 

Val Thr Gly Pro He Leu Pro Phe Leu Ser Lys Pro Ala Ser Gin Asp 

20 25 30 

Cys Asp Xle Phe He Ala Leu Thr Glu His Asp Glu Val Asn Met He 

35 40 45 

Ala Ala Val Leu Ala Lys Lys Met Gly Ala Lys Glu Thr He Val Arg 

50 55 60 

Val Arg Asn Pro Glu Tyr Ser Asn Ser Tyr Phe Lys Glu Lys Asn He 
65 7 0 75 80 

Leu Gly Phe Ser Leu He Val Asn Pro Glu Leu Leu Ala Ala Arg Ala 

85 go 95 

He Ala Asn He He Asp Phe Pro Asn Ala Leu Ser Val Glu Arg Phe 

100 105 no 

Ala Gly Gly Arg Val Ser Leu Met Glu Phe Val Val Lys Ser Thr Ser 

US 120 125 

Gly Leu Cys Gin Met Pro He Ser Asp Phe Arg Lys Lys Phe Gly Asn 
130 i3s 14Q 

Val He Val Cys Ala He Glu Arg Asp His Gin He He He Pro Ser 
145 150 155 160 

Gly Asp Met Thr Val Gin Asp Lys Asp Arg He Phe Val Thr Gly Asn 

165 170 175 

Arg Val Asp Met He Leu Phe His Asn Tyr Phe Lys Ser Arg Ala Val 

180 185 190 

Lys Ser Leu Leu He Val Gly Ala Gly Arg He Thr Tyr Tyr Leu Leu 

195 200 205 

Gly He Leu Lys Asp Ser Arg He Asp Thr Lys Val He Glu He Asn 

210 215 220 

Pro Glu He Ala Ser Phe Phe Ser Glu Lys Phe Pro Asn Leu Tyr He 
225 230 235 240 

Val Gin Gly Asp Gly Thr Ala Lys Asp He Leu Leu Glu Glu Ser Ala 

245 250 255 

Gin His Tyr Asp Ala Val Ala Thr Leu Thr Gly Val Asp Glu Glu Asn 

260 265 270 

Leu He Thr Ser Met Phe Leu Asp Arg Val Gly Val Gin Lys Asn He 

275 280 285 

Thr Lys Val Asn Arg Thr Ser Leu L°u Glu He He Asn Ala Pro Asp 

290 295 300 

Phe Ser Ser He He Thr Pro Lys Ser He Ala Val Asp Thr He Met 
305 310 315 320 

His Phe He Arg Gly Arg Val Asn Ala Gin Tyr Ser Arg Pro Ser Ser 
325 330 335 
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His Ala Pro Ser Ser Gin Trp Pro Asn Arg Asn Pro Ala lie Pro Tyr 

340 345 350 

Gin Gly Lys Pro lie Lys 
355 



(2) INFORMATION FOR SEQ ID NO: 399: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 



Met Leu Phe Thr Ala Leu Gin Met Asn Thr Asp He Leu Ala He Ser 

1 5 io is 

Gin Glu Val Gly Asp Trp Arg He Asp Leu Ala Ser Ser Gin Thr Glu 

20 25 30 

Met Gin Leu Ala Thr Ser Phe He Ser Pro Ser Gin Ala Leu Leu Asn 

35 40 45 

Leu Pro Gin Glu Asp Phe Asp Ser Cys Lys Ser Ser Ala Gin Ala Asp 

50 55 60 

Trp Glu Asn Leu Leu His Arg Phe Asp He He Glu He Gly Glu Ala 
65 70 75 80 

Asp Arg Thr Phe Phe Asp His Cys Leu Tyr Arg Leu Phe Leu Phe Pro 

85 90 95 

Gin Thr Phe Tyr Glu He Asn Glu Ser Gly Gin Ala He His Met Asp 

100 105 no 

Leu Ala Thr Gly Thr Val Lys Pro Gly Val Leu Phe Ser Asn Asn Gly 

115 120 125 

Phe Trp Asp Thr Phe Arg Thr Thr Phe Pro Leu Phe Ala Leu He He 

130 135 140 

Pro Glu His Tyr Gin Arg Phe Leu Glu Gly Phe Leu Asn Ser Tyr Arg 
145 150 155 160 

Asp Thr Gly Phe Leu Pro Lys Trp Leu Ala Pro Asp Glu Arg Gly Met 

165 170 175 

Met Pro Gly Thr Leu Leu Asp Gly He He Ala Asp Ser Ala Cys Lys 

180 185 190 

Asp Met Ala Pro Asp Leu Glu Gly Glu Leu Phe Gin Ala Met Leu Arg 
195 200 205 

369 



Asn Ser Gin Gin Ser Arg Pro Ser Arg Asn Gin Trp Pro Pro Arg Thr 

210 215 220 

Ser Pro lie Pro Arg Thr Arg Leu Pro Leu Tyr Arg Pro Pro Arg Lys 

225 2 30 2 35 240 

Cys 



(2) INFORMATION FOR SEQ ID NO: 400: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 

Met Met Val Ser His Ala Cys Ala Val Lys Leu Tyr Lys Glu Lys Gly 

1 5 10 15 

Tyr Lys Gly Glu He Gly Val Val His Ala Leu Pro Thr Lys Tyr Pro 

20 25 30 

Leu Asp Pro Glu Asn Pro Ala Asp Val Arg Ala Ala Glu Leu Glu Asp 

35 40 45 

He He His Asn Lys Phe He Leu Asp Ala Thr Tyr Leu Gly Arg Tyr 

50 55 go 

Ser Ala Glu Thr Met Glu Gly Val Asn His He Leu Ser Val Asn Gly 
65 70 75 80 

Gly Ser Leu Asp Leu Arg Glu Glu Asp Phe Thr Ala Leu Glu Ala Ala 

85 90 95 

Lys Asp Leu Asn Asp Phe Leu Gly He Asn Tyr Tyr Met Ser Asp Trp 

!00 105 no 

Met Glu Ala Phe Asp Gly Glu Thr Glu He He His Asn Gly Lys Gly 

120 125 
Glu Lys Gly Ser Ser Lys Tyr Gin He Lys Gly He Gly Arg Leu Val 

"0 135 140 

Ala Pro 
145 

(2) INFORMATION FOR SEQ ID NO: 401: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 183 amino acids 
<B) TYPE: amino acid 
(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 



Met Asn Thr Ser Leu Lys Leu Ser Lys Gin Leu Ser Phe Gly Glu Glu 

15 io is 

lie Ala Asn Ser Val Thr His Ala Val Gly Ala Val lie Met Leu He 

20 25 30 

Leu Leu Pro He Ser Ser He Tyr Ser Tyr Glu Ala His Gly Phe Leu 

35 40 45 

Ser Ser He Gly Val Ser He Phe Val He Ser Leu Phe Leu Met Phe 

50 55 60 

Leu Ser Ser Thr He Tyr His Ser Met Ala Tyr Gly Ser Thr His Lys 
65 70 75 80 

Tyr Val Leu Arg He He Asp His Ser Met He Tyr Val Ala He Ala 

85 90 95 

Gly Ser Tyr Thr Pro Val Val Leu Thr Leu Met Asn Asn Trp Phe Gly 

100 105 HO 

Tyr Leu He He Val He Gin Trp Gly Thr Thr He Phe Gly He Leu 

115 120 125 

Tyr Lys He Phe Ala Lys Lys Val Asn Glu Lys Phe Ser Leu Ala Leu 

130 135 140 

Tyr Leu He Met Gly Trp Leu Val Leu Ala He He Pro Ala He He 
"5 150 155 160 

Ser Gin Xaa Thr Pro Val Phe Trp He Leu Met Val Thr Gly Gly Leu 

165 170 175 

Cys Tyr Thr Val Gly Ala Asp 
180 

(2) INFORMATION FOR SEQ ID NO: 402 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 

Met Leu Glu Asn Leu Cys Glu Leu Pro Phe Arg Pro Glu Ala Gin Arg 

15 10 15 

Val Val Thr Lys Gly Tyr Leu Asn Leu Gin Asn Arg Asn Asp Leu Leu 

20 25 30 

Leu Val Glu Asp lie Thr Ala Asp Glu Trp Met Asp Val Gin Phe Glu 

35 40 45 

Leu Gin Pro Thr He Tyr Lys Leu Lys Glu Gly Asp Thr Leu Arg Leu 

50 55 60 

Val Leu Tyr Thr Thr Asp Phe Glu He Thr He Arg Asp Asn Thr Asp 
65 ™ 75 so 

Tyr His Leu Thr Val Asp Leu Ala Gin Ser Met Leu Thr Leu Pro Cys 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 403: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 

Met Asn Lys Ser Glu His Arg His Gin Leu He Arg Ala Leu He Thr 

1 5 io 15 

Lys Asn Lys He His Thr Gin Ala Glu Leu Gin Ala Leu Leu Ala Glu 

20 25 30 

Asn Asp He Gin Val Thr Gin Ala Thr Leu Ser Arg Asp He Lys Asn 

35 40 45 

Met Asn Leu Ser Lys Val Arg Glu Glu Asp Ser Ala Tyr Tyr Val Leu 

50 55 60 

Asn Asn Gly Ser He Ser Lys Trp Glu Lys Arg Leu Glu Leu Tyr Met 
65 70 75 80 

Glu Asp Ala Leu Val Trp Met Arg Pro Val Gin His Gin Val Leu Leu 

85 90 95 

Lys Thr Leu Pro Gly Leu Ala Gin Ser Phe Gly Ser He He Asp Asp 
100 105 no 
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Phe Glu Leu Pro 
115 

(2) INFORMATION FOR SEQ ID NO: 404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 

Met Gly Lys Leu Ser Ser He Leu Leu Gly Thr Val Ser Gly Ala Ala 

1 5 10 15 

Leu Ala Leu Phe Leu Thr Ser Asp Lys Gly Lys Gin Val Cys Ser Gin 

20 25 30 

Ala Gin Asp Phe Leu Asp Asp Leu Arg Glu Asp Pro Glu Tyr Ala Lys 

35 40 45 

Glu Gin Val Cys Glu Lys Gin Thr Glu Val Lys Glu Gin Ala Thr Asp 

50 55 60 

Phe Val Leu Asn Asn Lys Arg Thr Gly 
65 70 

(2) INFORMATION FOR SEQ ID NO: 405: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 

Met Tyr Leu Gly Asp Leu Met Glu Lys Ala Glu Cys Gly Gin Phe Ser 

15 10 15 

He Leu Ser Phe Leu Leu Gin Glu Ser Gin Thr Thr Val Lys Ala Val 

20 25 30 
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Met Glu Glu Thr Gly Phe Ser Lys Ala Thr Leu Thr Lys Tyr Val Thr 
35 40 45 

Leu Leu Asn Asp Lys Ala Leu Asp Ser Gly Leu Glu Leu Thr lie His 

50 55 60 

Ser Glu Asp Glu Asn Leu Arg Leu Ser lie Gly Ala Ala Thr Lys Gly 
65 70 75 80 

Arg Asp lie Arg Ser Leu Phe Phe Gly 
85 

(2) INFORMATION FOR SEQ ID NO: 406: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 

Met Arg Phe Tyr Phe Met Glu Asn Val Val Val His lie lie Ser His 

1 5 io 15 

Ser His Trp Asp Arg Glu Trp Tyr Leu Pro Phe Glu Ser His Arg Met 

20 25 30 

Gin Leu Val Glu Leu Phe Asp Asn Leu Phe Asp Leu Phe Glu Asn Asp 

35 40 45 

Pro Glu Phe Lys Ser Phe His Leu Asp Gly Gin Thr lie Val Leu Asp 

50 55 60 

Asp Tyr Leu Gin lie Arg Pro Glu Asn Arg Asp Lys Val Gin Arg Tyr 
65 70 75 80 

He Asp Glu Gly Lys Leu Lys He Gly Pro Phe Tyr He Leu Gin Asp 

85 90 95 

Asp Tyr Leu He Ser Ser Glu Ala Asn Val Arg Asn Thr Leu He Gly 

100 105 no 

Gin Gin Glu Ala Ala Lys Trp Gly Lys Ser Thr Gin He Gly Tyr Phe 

H5 120 125 

Pro Asp Thr Phe Gly Asn Met Gly Gin Ala Pro Gin He Leu Gin Lys 

130 135 140 

Ser Gly He His Val Ala Ala Phe Gly Arg Gly Val Lys Pro He Gly 
145 150 155 160 

Phe Asp Asn Gin Val Leu Glu Asp Glu Gin Phe Thr Ser Gin Phe Ser 
165 170 175 
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Glu Met Tyr Tip Gin Gly Val Asp Gly Ser Arg Val Leu Gly He Leu 

180 185 190 

Phe Ala Asn Trp Tyr Ser Asn Gly Asn Glu lie Pro Val Asp Lys Asp 

195 200 205 

Glu Ala Leu Thr Phe Trp Lys Gin Lys Leu Ser Asp Val Arg Cys Leu 

210 215 220 

Arg Phe Asp Gin Pro Met Val Asp Asp Glu Thr Ala Val Thr Thr Ser 
225 230 235 240 

Leu Ser Gin Glu Lys Ser Glu Arg Ser His Ser Val Leu Gin Met Asn 
245 250 255 

Ser Ser Arg Met 
260 



(2) INFORMATION FOR SEQ ID NO: 407: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 

Leu Thr Ala Ser Pro Leu Lys Lys Ser He Lys Lys Lys Asn Arg Lys 

1 5 10 15 

Leu Thr Asn Lys Xaa Glu Lys His Met Ser Lys He He Gly He Asp 

20 25 30 

Leu Gly Thr Thr Asn Ser Ala Val Ala Val Leu Glu Gly Thr Glu Ser 

35 40 45 

Lys He He Ala Asn Pro Glu Gly Asn Arg Thr Thr Pro Ser Val Val 

50 55 60 

Ser Phe Lys Asn Gly Glu He He Val Gly Asp Ala Ala Lys Arg Gin 
65 70 75 ao 

Ala Val Thr Asn Pro Asp Thr Val He Ser He Lys Ser Lys Met Gly 

85 90 95 

Thr Ser Glu Lys Val Ser Ala Asn Gly Lys Glu Tyr Thr Pro Thr Arg 

100 105 no 

Asn Leu Ser Tyr Asp Pro Ser He Leu Glu Arg Leu Arg 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 408: 
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Leu 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 305 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

<ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:408: 

Met Asp ser Thr Pro Val lie Xaa Gly Ala Met Leu He Ser Pro 

Met Thr Pro He Leu Gly Val Gly Leu Ser Leu Ala lie Phe Asp Phe 

20 25 30 

Lys Leu Leu Arg Lys Ser Phe Lys He Leu Ala He Gin He Leu Ala 

35 40 45 

ser Leu He Ala Ser Thr Leu Tyr Phe Tyr Leu Ser Pro He Ser Tyr 

50 55 60 

Ala Ser Ser Glu He Val Ala Arg Thr Ser Pro Thr He Trp Asp Val 
65 70 75 so 

Leu He Ala Phe Val Gly Gly He Ala Gly He He Gly Ala Arg Lys 

fl 5 90 95 

Lys Glu Thr Asn Asn He Val Pro Gly Val Ala He Ala Thr Ala Leu 

100 105 no 

Met Pro Pro Leu Cys Thr Val Gly Tyr Ala He Ala Ser Ala Asn Leu 

I 15 120 125 

Lys Phe He He Gly Ser Ser Tyr Leu Phe Leu He Asn Cys Ser Phe 

130 135 140 

He Val He Ala Thr Tyr He Gly Val Arg Leu Met Met Val Lys Lys 
145 I" 155 160 

His Tyr Phe Lys Asp Asn Glu Glu Asp Ser Lys Met Arg Arg He Leu 

165 170 175 

Leu Leu Val Ala Val Leu Leu Met He Pro Ser Phe He Ser Ala Thr 

180 185 190 

Thr Leu Val Arg Glu Thr Leu Lys Lys Glu Ser Leu Lys Lys Phe He 

I 95 200 205 

Ser Glu Gin Phe Gin Gly His Asn He Leu Lys Lys Thr Tyr Ser Lys 

210 215 220 

Lys Thr His Thr Leu Lys Leu Thr He Ser Gly Asn Tyr Leu Thr Glu 

225 230 „c 

" u 235 240 

Glu Glu Leu Asp Met He Ser Ser Lys Arg Gly Asp Tyr Gly Leu Ser 
245 250 255 
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Asp Val Ser Val Gin Val Ser Gin Leu Ser Asp Ser Glu Gin Leu Ser 

260 265 270 

Lys Glu Glu Leu Val Glu Tyr Phe Phe Gin Tyr He Lys Asp Lys Glu 

275 280 285 

Ala Lys Glu Lys Glu Lys Ala Asn Lys Phe Tyr Thr Glu Ser Glu Glu 
290 295 300 

Gin 
305 



(2) INFORMATION FOR SEQ ID NO: 409: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 



Met He Ala Arg Ser Tyr Gin Glu Met Asp Asp Leu Asp Thr Ala Tyr 

1 5 10 15 

Glu His Tyr Gin Glu Leu Thr Gly Asp Leu Lys Asp Asn Pro Glu Phe 

20 25 30 

Leu Glu His Tyr He Tyr Leu Leu Arg Glu Leu Gly His Phe Glu Glu 

35 40 45 

Ala Lys val His Ala His Thr Tyr Leu Lys Leu Val Pro Asp Asp Val 

50 55 60 

Gin Met Gin Glu Leu Phe Glu Arg Leu 
65 to 



(2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 10: 
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Met Leu Glu Glu Thr Gin Glu He Val Ala Gly Arg Val Ala Ser Val 

15 10 15 

Glu Thr Leu Lys Arg He Glu Glu Leu Gly Phe Asp Phe Val Cys Leu 

20 25 30 

Thr Gly Asn Pro Gly Thr Gly Val Ser Asn Arg Glu He He Lys Ala 

35 40 45 

Val Gin Thr Ala Lys Glu Asn Phe Ser Gly Leu He lie Ala Gly Lys 

50 55 60 

Met His Gly Ala Gly Val Asn Glu Pro Val Ala Glu Leu Ser Val Ala 
65 70 75 80 

Glu Gin Leu Leu Glu Ala Gly Ala Asp Val lie Leu Val Pro Ala Val 

85 90 95 

Gly Thr Val Pro Ala Phe His Asp Gin Glu Leu Arg Glu Val Val Asp 

100 105 no 

Leu Val His Ser Lys Gly Gly Leu Val Leu Ser Ala He Gly Thr Ser 

H5 120 125 

Gin Glu Thr Ser Asp Thr Asp Thr lie Lys Glu lie Ala Leu Arg Asn 

130 135 140 

Lys He Cys Gly Val Asp He Gin His lie Gly Asp Ala Gly Tyr Gly 
145 150 155 160 

Gly Leu Ala Thr Val Asp Asn lie Tyr Ala Leu Ser Lys Ala He Arg 

165 170 175 

Gly Val Arg His Thr Val Ser Arg Leu Ala Arg Ser Val Asn Arg 
180 185 190 

(2) INFORMATION FOR SEQ ID NO: 411: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY.: linear 

(ii) MOLECULE TYPE: Protein 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO:411: 

Met Ala Lys Val Thr lie Met Leu Ala Cys Ala Ala Gly Met Ser Thr 

15 io is 

Ser Leu Leu Val Thr Lys Met Gin Lys Ala Ala Glu Asp Lys Gly Leu 
20 25 30 
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Asp Ala Glu lie Phe Ala Val Pro Ala Pro Glu Ala Glu Glu He Val 
35 40 45 

Ala Thr Lys Glu Val Asn Val Leu Leu Leu Gly Pro Gin Val Arg Tyr 

50 55 60 

Leu Leu Gly Asp Phe Gin Glu Lys Leu Lys Asp Arg Gin He Pro Val 
65 7 ° 75 so 

Ala Val He Pro Met Thr Asp Tyr Gly Met Met Asn Gly Ser Lys Val 

fl 5 90 95 

Leu Asp Leu Ala Glu Ser Leu Leu Asp 
100 10 5 



(2) INFORMATION FOR SEQ ID NO: 412: 

(i) SEQUENCE CHARACTERISTICS: 
IA) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 

Met Leu Gin Arg Gin Gin Ala Ser Ala He lie Asp Ala Arg Lys Met 

He Val Asp Gly Ala Val Gly Met Val Glu Met Ala Leu Glu Arg Leu 

20 25 30 

Asn Glu Gly Glu Leu Val Glu Leu Asp Glu Glu Arg Lys Ala Ala Met 

35 40 45 

Val Ser Asn Leu Leu Val Val Leu Cys Gly Asn His Asp Ala Gin Pro 

50 55 so 

He val Asn Thr Gly Ser Leu Tyr 
65 70 



(2) INFORMATION FOR SEQ ID NO: 413: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

379 



WO 97/43303 



PCT/US97/07950 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:413: 



Met Val Ala Gly Phe Thr Gly Glu 

1 5 
Glu Ala Phe Lys Trp Gly Val Ala 
20 

Asp Asp Leu Ala Thr Ala Glu Phe 

35 40 
Glu Val Glu Lys Arg 
50 



Phe Val Lys Ser Lys Asp Ala Val 

10 15 
Cys Gly Thr Ala Thr Thr Phe Ser 
25 30 
lie Lys Glu Thr Tyr Gly Lys Val 
45 



(2) INFORMATION FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 

Met Tyr Glu Ser Pro Val Gly Phe Arg His Gly Pro Lys Ser Leu He 

15 10 15 

Asn Asp Asn Thr Val Val Leu Val Phe Gly Thr Thr Thr Asp Tyr Thr 

20 25 30 

Arg Lys Tyr Asp Leu Asp Leu Val Arg Glu Val Ala Gly Asp Gin He 

35 40 45 

Ala Arg Arg Val Val Leu Leu Ser Asp Gin Ala Phe Gly Leu Glu Asn 

50 55 6 o 

Val Lys Glu Val Ala Leu Gly Cys Gly Gly Val Leu Asn Asp lie Tyr 
65 70 75 80 

Arg Val Phe Pro Tyr lie Val Tyr Ala Gin Leu Phe Ala Leu Leu Thr 

85 90 95 

Ser Leu Lys Val Glu Asn Lys Pro Asp Thr Pro Ser Pro Thr Gly Thr 

100 105 no 

Val Asn Arg Val Val Gin Gly Val He He His Glu Tyr Gin Lys 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 415: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 415: 

Met Ser His Asp His Asn His Asp His Glu Glu Arg Glu Leu He Thr 

15 10 15 

Leu Val Asp Glu Gin Gly Asn Glu Thr Leu Phe Glu He Leu Leu Thr 

20 25 30 

He Asp Gly Lys Glu Glu Phe Gly Lys Asn Tyr Val Leu Leu Val Pro 

35 40 45 

Val Asn Ala Glu Glu Asp Glu Asp Gly Gin Val Glu He Gin Ala Tyr 

50 55 60 

Ser Phe He Glu Asn Glu Asp Gly Thr Glu Gly Glu Leu Gin Pro He 
65 70 75 80 

Pro Glu Asp Ser Glu Asp Glu Trp Asn Met He Glu Glu Val Phe Asn 

85 90 95 

Ser Phe Met Glu Glu 
100 



(2) INFORMATION FOR SEQ ID NO: 41 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:416: 

Met Glu Met Gin Thr Ser Ala Gin Leu Leu Thr Asn Lys He Phe Leu 

15 10 15 

Lys Asn Pro Leu Lys Ala Leu Val Glu Glu Lys Tyr Gly He Glu Tyr 

20 25 30 

Glu Glu Phe Thr Asn Pro Trp His Ala Ala He Ser Ser Phe Val Ala 
35 40 45 
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Phe Phe Leu Arg Ser Leu Pro Pro Met Leu Ser Val Thr lie Phe Pro 

50 55 60 

Ser Glu Tyr Arg He Pro Ala Thr Val Leu He Val Gly Val Ala Leu 
65 70 75 eo 

Leu Leu Thr Gly Tyr Thr Ser Ala Arg Leu Gly Lys Asp Pro Thr Arg 
85 90 95 

Thr Val 



(2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:417: 

Met Pro Trp Asn Ala Ala Tyr Val Glu Pro Ser Arg Arg Pro Ala Asp 

1 5 io 15 

Gly Arg Tyr Gly Glu Asn Pro Asn Arg Leu Tyr Gin His His Gin Phe 

20 25 30 

Gin val Val Met Lys Pro Ser Pro Ser Asn He Gin Glu Leu Tyr Leu 

35 40 45 

Glu Ser Leu Glu Lys Leu Gly He Asn Pro Leu Glu His Asp He Arg 

50 55 60 

Phe Val Glu Asp Asn Trp Glu Asn Pro Ser Thr Gly Ser Ala Gly Leu 
65 7 ° 75 so 

Gly Trp Glu Val Trp Leu Asp Gly Met Glu He Thr Gin Phe Thr Tyr 

85 go 95 

Phe Gin Gin Val Gly Gly Leu Ala Thr Gly Pro Val Thr Ala Glu Val 

100 105 no 

Thr Tyr Gly Leu Glu Arg Leu Ala Ser Tyr He Gin Glu Val Asp Ser 

115 120 125 

Val Tyr Asp He Glu Trp Ala Asp Gly Val Lys Tyr Gly Glu He Phe 

130 135 140 

He Gin Pro Glu Tyr Glu His Ser Lys Tyr Ser Phe Glu He Ser Asn 

1*5 150 ,ec 

" w 155 160 

Gin Glu Met Leu Leu Glu Asn Phe Asp Lys Phe Glu Lys Glu Ala Gly 
165 170 175 
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Arg Ala Leu Glu Glu Gly Leu Val His Pro Ala Tyr Asp Tyr Val Leu 

180 185 190 

Lys Cys Ser His Thr Phe Asn Leu Leu Asp Ala Arg Gly Ala Val Ser 

195 200 205 

Val Thr Glu Pro Cys Arg Leu Tyr Arg Ser Val Ser Val Thr Trp Pro 

210 215 220 

Val Leu 
225 

(2) INFORMATION FOR SEQ ID NO: 418: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 

Met Ser Lys Glu Leu Thr Phe Gin Glu He He Leu Thr Leu Gin Gin 

1 5 10 15 

Phe Trp Asn Asp Gin Gly Cys Met Leu Met Gin Ala Tyr Asp Asn Glu 

20 25 30 

Lys Gly Ala Gly Thr Met Ser Pro Tyr Thr Phe Leu Arg Ala Tyr Arg 
35 40 45 

Thr 



(2) INFORMATION FOR SEQ ID NO: 419 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419 : 
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Met Phe lie Thr Met Gly Val Gly Ala Ala lie Leu Leu Leu lie Trp 

lie Phe Phe Lys Gin Leu Leu He Thr Ser Phe Asp Glu Leu Leu Ala 

20 25 30 

Lys Ala Met Gly Met Pro Val Asn Phe Tyr His Tyr Leu Leu Met Val 



35 4 o 



45 

Leu Leu Thr Leu Val Ser Val Thr Ala Met Gin Ser Val Gly Thr He 

50 55 60 

Leu He Val Ala Met Leu He Thr Pro Ala Ala Thr Ala Tyr Leu Tyr 
65 70 75 



80 



Ala Asn Ser Leu Lys Ser Met He Phe Leu Ser Ser Thr Phe Gly Ala 

85 SO 95 

Thr Ala Ser Val Leu Gly Leu Phe He Gly Tyr Ser Phe Asn Val Ala 

100 "5 no 

Ala Gly ser Ser He Val Leu Thr Ala Ala Ser Phe Phe Leu He Ser 



115 120 125 



Phe Phe He Ala Pro Lys Gin Arg Tyr Leu Lys Leu Lys Asn Lys His 

130 135 140 

Leu Leu Lys 

145 



(2) INFORMATION FOR SEQ ID NO: 420: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420 : 

Met Leu Arg Asn Arg Leu Glu Thr Glu Phe Gly Lys Arg Val Glu Val 

15 10 15 

Val Asp Val lie Gly Tyr Tyr Glu Leu Asn Gin Glu Lys Leu Lys Gly 

20 25 30 

He Asp Phe lie Val Ser Ala Val Asp Leu Ser Asn Leu Tyr Phe Gin 



35 



40 



45 



He Pro Val Phe Lys Val Ser Val Phe Leu Lys Ser Asp Glu Met Glu 

50 55 60 

Met lie Arg Lys Ala Met Val Gin Met Gin Val Ser Ser Tyr Val Gin 

" 70 75 80 
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Ser Ser Lys He Asn Lys Phe Glu Asn Asn Gly Phe Arg Gin Tyr Phe 

85 90 95 

Ser Lys Glu Asn Phe Leu He Cys Thr Glu Ser Asp Lys Val Asn Leu 

100 105 HO 

Leu Glu Lys Met Val Glu Ser Leu Ser Val Gly Glu Ser Asn Glu Phe 

115 120 125 

Glu Gin Ser Leu Leu Tyr Gly He Lys Gin Arg Glu Glu Leu Ser Ser 

130 135 140 

Val Val Phe Ser Glu Lys He Ala Val Pro His Pro He Gin Pro Phe 
145 150 155 160 

Gly Thr Glu Glu Lys Val Ser Val Ala He Cys Lys Asp Ser Leu Leu 

165 170 175 

Trp Asp Asn Gin Ser Ser His Val Gin Thr Ser Tyr Phe Phe Tyr Leu 

180 185 190 

His Gin Tyr Met Gly Thr Glu Gly Leu Ala Thr Val Thr Lys Lys He 

195 200 205 

Val Ser Leu Thr Glu Asn Asp Glu Leu Gin Asn Gin Leu He Ser Cys 

210 215 220 

Asn Asn Phe Glu Asp Phe He Asn He Phe Glu Lys He Lys 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:421: 

Met Pro Arg Ala Ser Phe Asp Gin Val Asn Gin Val Arg Gin Glu Asn 

15 io is 

Gly Glu Pro Glu Phe Ala Asn Pro Arg Asn Ala Ala Ala Gly Thr He 

20 25 30 

Val Ser Val Gly Tyr Ser Ser Ser Phe Gin Ala 
35 40 



(2) INFORMATION FOR SEQ rID NO: 422: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 2: 
Met Arg Gin Gin Glu Leu Leu Cys Gin Leu Asp Thr Ala Val val Ser 
Lys Arg Asn Leu Ala Thr Phe Leu Tyr Gin Glu Ala Ser Pro Ser Thr 



20 25 



30 

Arg Asp Ser Gin Glu Lys Gly Leu Lys Tyr Leu Glu Gin Leu Gly lie 

35 « 45 

Val val Asn His Lys Arg He Phe Gly Trp Lys lie Asp Lys lie Trp 

50 55 60 

Asn Phe He Gin Glu Val Gly Gin Glu Arg Glu Asn Leu Pro 
65 70 75 

<2) INFORMATION FOR SEQ ID NO: 423: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:423: 
Met Ala Leu Thr Glu Gin Lys Arg Ala Arg Leu Glu Lys Leu Ser Asp 



5 10 15 



Glu Asn Gly He He Ser Ala Leu Ala Phe Asp Gin Arg Gly Ala Leu 

22 25 30 

Lys Arg Leu Met Ala Gin His Gin Thr Glu Glu Pro Thr Val Ala Gin 

35 40 45 

Met Glu Glu Leu Lys Val Le U Val Ala Asp Glu Leu Thr Lys Tyr Ala 



50 55 60 



Ser Ser Met Leu Leu Asp Pro Glu Tyr Gly Leu Pro Ala Thr Lys Ala 

65 70 7 c 

75 80 
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Leu Asp Glu Lys Ala Gly Leu Leu 
85 

Asp Thr Thr Ser Thr Lys Arg Leu 
100 

Ala Lys Arg lie Lys Glu Glu Gly 
115 120 
Tyr Tyr Asp Val Asp Ser Ser Asp 

130 135 
Tyr lie Gly Ala Tyr Arg Phe 
145 150 



Leu Ala Tyr Glu Lys Thr Gly Tyr 

90 95 
Pro Asp Cys Leu Asp Val Trp Ser 
105 no 
Ala Asp Ala Val Lys Phe Leu Leu 
125 

Glu Leu Asn Gin Glu Lys Gin Ala 
140 



(2) INFORMATION FOR SEQ ID NO: 424: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 



Leu Asn Trp He Leu Leu Gin Gly Leu He Cys Gin Arg Asp Ala Ser 

15 10 is 

Tyr Asp Met Lys Gin Asp Asp Leu Asp Lys Val Ala Asp Tyr Leu Phe 

20 25 30 

Lys Thr Glu Glu Trp Thr Met Tyr Glu Leu He Leu Phe Gly Asn Leu 

35 40 45 

Tyr Ser Phe Tyr Asp Val Asp Tyr Val Thr Arg He Gly Arg Glu Val 

50 55 60 

Met Glu Arg Glu Glu Phe Tyr Gin Glu He Ser Arg His Lys Arg Leu 
65 70 75 80 

Val Leu He Leu Ala Leu Asn Cys Tyr Gin His Cys Leu Glu His Ser 

85 90 95 

Ser Phe Tyr Asn Ala Asn Tyr Phe Glu Ala Tyr Thr Glu Lys lie He 

100 105 no 

Asp Lys Gly He Lys Leu Tyr Glu Arg Asn Val Phe His Tyr Leu Lys 

115 120 125 

Gly Phe Ala Leu Tyr Gin Lys Gly Gin Cys Lys Glu Gly Cys Lys Gin 

130 135 140 

Met Gin Glu Thr Met His He Phe Asp Val Leu Gly Leu Pro Glu Gin 
145 150 155 

387 
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Val Ala Tyr Tyr Gin Glu His Tyr Glu Lys Phe Val Lys Ser 
165 170 



(2) INFORMATION FOR SEQ ID NO: 42 5: 

<i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 149 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:425: 

Met Tyr Pro Asp Asp Ser Leu Thr Leu His Thr Asp Leu Tyr Gin lie 

Asn Met Met Gin Val Tyr Phe Asp Gin Gly He Tyr Asn Lys Lys Ala 

20 25 30 

Val Phe Glu Val Tyr Phe Arg Gin Gin Pro Phe Lys Asn Gly Tyr Ala 

35 40 45 

Val Phe Ala Gly Leu Glu Arg lie Val Asn Tyr Leu Glu Asp Leu Arg 

50 55 60 

Phe Ser Asp Ser Asp lie Ala Tyr Leu Glu Ser Leu Gly Tyr His Gly 
" 70 75 80 

Ala Phe Leu Asp Tyr Leu Arg Asn Phe Lys Leu Glu Leu Thr Val Arg 

85 90 95 

Ser Ala Gin Glu Gly Asp Leu Val Phe Ala Asn Glu Pro lie Val Gin 

100 105 i 10 

Val Glu Gly Pro Leu Ala Gin Cys Gin Leu Val Glu Thr Ala Leu Leu 

115 120 125 

Asn lie Val Asn Tyr Gin Thr Leu Val Ala Thr Lys His Ser Tyr Ser 

130 135 140 

Phe Gly Tyr Arg Arg 
145 



(2) INFORMATION FOR SEQ ID NO: 426: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 



Met Leu Val Leu Asn Tyr Val Tyr 

1 5 
Ala Asn Phe Asp He Gly Lys He 
20 

Thr Met Val Leu Pro Phe Asn Leu 

' 35 40 
Ser Phe Trp Leu Leu Tyr Val Leu 

50 55 
Glu Arg 
65 



Ala Val Pro Leu Tyr Ala Lys Phe 

10 15 
Leu Gly Leu Ser Asn Tyr Leu Met 
25 30 
He Glu Gly Val He Phe Ser Val 
45 

Leu Lys Pro Thr Leu Lys His Tyr 
60 



(2) INFORMATION FOR SEQ ID NO: 427: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 

Met Glu Glu Ser Lys Glu Leu Asn Ala Val lie Asp Val He Met Leu 

1 5 io 15 

Ala Gly Thr lie Leu Leu Lys Ser Gly Ser Glu He His Arg Val Glu 

20 25 30 

Asp Thr Met He Arg. He Ala His Ser Gin Gly He Val Asp Cys Asn 

35 40 45 

Val Leu Ala Met Pro Ala Ala He Phe Phe Ser He Glu Asn Thr Asn 

50 55 60 

He Ser Arg Met Asn Ala 
65 70 



(2) INFORMATION FOR SEQ ID NO: 42 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 104 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 

Leu Phe Glu Lys Lys Gly Leu Thr Val Asp Ala Asn Gly Asn Ala Thr 

15 " 15 

Val Asp Leu Thr Phe Lys Glu Asp Ser Glu Lys Gly Lys Ser Arg Phe 

20 25 30 

Gly Val Ser Leu Lys Phe Lys Asp Thr Asn Asn Asn Val Phe Val Gly 

35 40 45 

Tyr Asp Lys Asp Gly Trp Phe Trp Glu Tyr Lys Ser Pro Thr Thr Ser 

50 55 60 

Thr Trp Tyr Arg Gly Ser Arg Val Ala Ala Pro Glu Thr Gly Ser Thr 
65 7 ° 75 80 

Asn Arg Leu Ser He Ser Leu Gin Val Arg Arg Ser Gly Lys Cys Gin 

85 90 95 

Ala Asn Lys Asp Val Asn Leu Leu 
100 

(2) INFORMATION FOR SEQ ID NO: 42 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

tC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 

Met Val Glu Asn Pro Glu Gly Phe His Phe Asp Asp Leu Gin Leu Gin 

Thr His Ala Asp Asn Asp He Glu Ala Leu Val Ser Leu Ala Asn Met 

20 25 30 

Asp Gly Glu Lys Val Glu Phe Asn Ala Thr Gly Gin Gly Ser Val Glu 

35 40 45 

Ala He Phe Asn Ala He Asp Lys Phe Phe Asn Gin Ser Val Arg Leu 
50 55 go 
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Val Ser Tyr Thr lie Asn Ala Val Thr Asp Gly lie Asp Ala Gin Asp 
65 70 75 80 

Arg Val Leu Val Thr Val Glu Asn Arg Asp Thr Glu Thr lie Phe Asn 

85 90 95 

Ala Ala Gly Leu Asp Phe Asp Val Leu Lys Ala Ser Ala lie Ala Tyr 

100 105 110 

lie Asn Ala Asn Thr Phe Val Gin Lys Glu Asn Ala Gly Glu Met Gly 

115 120 125 

Arg Ser Val Ser Tyr His Asp Met Pro Ser Val 
130 135 

(2) INFORMATION FOR SEQ ID NO: 430: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430 : 

Met Ala Lys Lys Lys He Lys Lys Glu Lys He Asp Asn Val Gly He 

15 10 15 

His Ser Phe Ser Lys Lys Ala Asp He Phe Phe Ser He He Ser Gly 

20 25 30 

Leu He Ala Leu Ser Cys He Leu Pro Phe Val Phe Val He lie He 

35 40 45 

Ser Val Thr Asp Glu Lys Ser Leu Leu Gin Tyr Gly Tyr Ser Phe Phe 

50 55 60 

Pro Ser Gin Phe Gly Leu Asp Gly Phe Glu Phe Leu Ala Gin Phe Lys 
65 70 75 80 

Asp Lys He Leu Gin Ala Leu Phe lie Ser Val Phe Val Thr Val Val 

85 90 95 

Gly Thr Leu Thr Asn Val Phe lie Thr Thr Thr Tyr Ala Tyr Ala lie 

100 105 110 

Ser Arg Thr Thr Phe Lys Tyr Arg Arg Phe Phe Thr He Phe Val Leu 

115 120 125 

Leu Ser Met Leu Phe Asn Ala Gly Leu Val Pro Gly Tyr He Met Val 

130 135 140 

Thr Arg Val Leu Gin Leu Gly Asp Thr Val Trp Ala Phe Asp Cys Ser 
145 150 155 160 
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Asn Ala Ser Leu Thr lie Gin Tyr His Leu Asp Ala Phe Phe Phe Gin 

165 170 175 

Glu Asp His Ser Arg Ser His Ser Arg lie Gly Ser Tyr Arg Trp Cys 
180 185 ago 

Gin 



(2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Protein 

(3Ci) SEQUENCE DESCRIPTION: SEQ ID NO:431: 

Met Lys Thr Arg Lys lie Pro Leu Arg Lys Ser Val Val Ser Asn Glu 

15 10 15 

Val lie Asp Lys Arg Asp Leu Leu Arg He Val Lys Asn Lys Glu Gly 

20 25 30 

Gin Val Phe lie Asp Pro Thr Gly Lys Ala Asn Gly Arg Gly Ala Tyr 

35 40 45 

He Lys Leu Asp Asn Ala Glu Ala Leu Glu Ala Lys Lys Lys Lys Val 

50 55 60 

Phe Asn Arg Ser Phe Ser Met Glu Val Glu Glu Ser Phe Tyr Asp Glu 

65 70 75 80 

Leu lie Ala Tyr Val Asp His Lys Val Lys Arg Arg Glu Leu Gly Leu 
85 90 95 

Glu 



(2) INFORMATION FOR SEQ ID NO: 432: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 169 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:432: 

Met He Arg Lys Val Glu Met Ala Asp Val Glu Val Leu Ala Lys He 

15 10 15 

Ala Lys Gin Thr Phe Arg Glu Thr Phe Ala Tyr Asp Asn Thr Glu Glu 

20 25 30 

Gin Leu Gin Glu Tyr Phe Glu Glu Ala Tyr Ser Leu Lys Thr Leu Ser 

35 40 45 

Thr Glu Leu Gly Asn Pro Asp Ser Glu Thr Tyr Phe He Met Gin Glu 

50 55 60 

Glu Glu He Ala Ala Phe Leu Lys Val Asn Trp Gly Ser Ala Gin Thr 
65 70 75 8 o 

Glu Arg Glu Leu Glu Asp Ala Phe Glu He Gin Arg Leu Tyr Val Leu 

85 90 95 

Gin Lys Phe Gin Gly Phe Gly Leu Gly Lys Gin Leu Phe Glu Phe Ala 

100 105 no 

Leu Glu Leu Ala Thr Lys Asn Ser Phe Ser Trp Ala Trp Leu Gly Val 

115 120 125 

Trp Glu His Asn Thr Lys Ala Gin Ala Phe Tyr Asn Arg Tyr Gly Phe 

130 135 140 

Glu Lys Phe Ser Gin His His Phe Met Val Gly Gin Lys Val Asp Thr 
145 150 155 leo 

Asp Trp Leu Leu Arg Lys Lys Leu Arg 
165 

(2) INFORMATION FOR SEQ ID NO: 433: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOEQLOGY.: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 

Met Lys Ser He He Asp Val Lys Asn Leu Ser Phe Arg Tyr Lys Glu 

15 io 15 

Asn Gin Asn Tyr Tyr Asp Val Lys Asp He Thr Phe His Val Lys Arg 
20 25 30 
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Gly Glu Trp Leu Ser lie Val Gly His Asn Gly Ser Gly Lys Ser Thr 
35 40 45 

Thr Val Arg Leu lie Asp Gly Leu Leu Glu Ala Glu Ser Gly Glu He 
50 55 60 



Val He Asp Gly Gin Arg Le U Thr Glu Glu Asn Val Trp Asn He Arg 
65 70 75 80 

Arg Gin lie Gly Met Val Phe Gin Asn Pro Asp Asn Gin Phe Val Gly 

85 90 95 

Ala Thr Val Glu Asp Asp Val Ala Phe Gly Leu Glu Asn Gin Gly Leu 



100 105 



110 

Ser Arg Gin Glu Met Lys Lys Arg Val Glu Glu Ala Leu Ala Leu Val 

115 120 125 

Gly Met Leu Asp Phe Lys Lys Arg Glu Pro Ala Arg Pro He Arg Cys 

130 135 140 

Gin Lys His Val Trp Pro Leu Gin Val Leu 
145 ISO 

(2) INFORMATION FOR SEQ ID NO: 434 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434: 

Met Gin Val Ala Phe Thr Gly Glu Thr Ser Pro Gin Val Leu Lys Glu 

He Gly Thr Asp Tyr Val Val He Gly His Ser Glu Arg Arg Asp Tyr 

20 25 30 

Phe His Asp Thr Asp Glu Asp Val Asn Lys Lys Ala Lys Ala He Phe 

35 40 45 

Ala Asn Gly Met Leu Pro He He Cys Cys Gly Glu Ser Leu Glu Thr 

50 55 60 

Tyr Glu Ala Gly Lys Ala Ala Glu Phe Val Gly Ala Gin Val Ser Ala 
" 70 75 80 

Ala Leu Ala Gly Leu Thr Ala Glu Gin Val Ala Ala Ser Val He Ala 

85 90 95 

Tyr Glu Pro He Trp Ala He Gly Thr Gly Lys Ser Ala Ser Gin Asp 
100 105 no 
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Asp Ala Gin Lys Met Cys Lys Val 
115 120 
Phe Gly Gin Glu Val Ala Asp Lys 

130 135 
Val Lys Pro Glu Asn Val Ala Ser 
145 150 
Gly Ala Leu Val Gly Gly Ala Ser 
165 

Leu Leu Asp Phe Val Lys 
180 



Val Arg Asp Val Val Ala Ala Asp 
125 

Val Arg Val Gin Tyr Gly Gly Ser 
140 

Tyr Met Ala Cys Pro Asp Val Asp 
155 160 
Leu Glu Ala Glu Ser Phe Leu Ala 
170 175 



(2) INFORMATION FOR SEQ ID NO: 435: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 5: 

Met Gly Lys Lys Arg Trp Ala Arg Asn Gly Phe Glu Ser Asn Asp Ala 

1 5 io 15 

Ser Tyr Ala Gin Val Val Ser Leu Tyr Asp Asp Thr Ser He Ser Val 

20 25 30 

Ser Asn Asn Glu Thr Asp Lys Val Leu Ala Gly Ser Leu Tyr Thr Asp 

3 5 40 45 

Thr Asn Glu Gin Gly Leu Thr He Pro Ser Ser Leu Leu Lys Asn Trp 

50 ' 55 go 

Asn Glu Gin Thr Gly Lys Asn Leu Thr Ala Asn Asp Leu He Gly Lys 
65 70 75 so 

Ser Val Ser Val Ser He Val Glu Ser Ala Ala Glu Thr Ser Lys He 

85 90 95 

Ala Gin Phe Gin Thr Lys He Val Arg Val lie Asn Asp Glu Asp Asp 

100 105 no 

Met Glu Asp Ser Asn Ser Phe Met Leu Ser His Gin Met Glu Thr He 

115 120 125 

Leu Lys Glu Ala Gly Phe Thr Lys Ala Val Ser Tyr Phe He Leu Glu 

130 135 140 

Leu Lys Asp Pro Ser Gin Thr Lys Val Val Thr Glu Glu Leu Gin Lys 

145 150 - cc 

A0U 155 160 
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Asn Lys Lys Tyr Thr Val Leu Ser 
165 

(2) INFORMATION FOR SEQ ID NO: 436: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 6: 

Met Met His Thr Tyr Leu Gin Lys Lys lie Glu Asn lie Lys Thr Thr 

heu Gly Glu Met Ser Gly Gly Tyr Arg Arg Met Val Ala Ala Met Ala 

20 25 3 0 

Asp Leu Gly Phe Ser Gly Thr Met Lys Ala He Trp Asp Asp Leu Phe 

35 40 45 

Ala His Arg Ser Phe Ala Gin Trp lie Tyr Leu Leu Val Ser Gly Ser 

50 55 60 

Phe Pro Leu Trp Leu Glu Leu Val Tyr Glu His Arg lie val Asp Trp 
65 70 75 eo 

He Gly Met He Cys Ser Leu Thr Gly lie lie Cys Val lie Phe Val 

« 90 95 

Ser Glu Gly Arg Ala Ser Asn Tyr Leu Phe Gly Leu lie Asn Ser Val 

100 105 no 

He Tyr Leu He Leu Ala Leu Gin Lys Gly Phe Tyr Gly Glu Val Leu 

H5 120 125 

Thr Thr Leu Tyr Phe Thr Val Met Gin Pro lie Gly Leu 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 437: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 

Met Met Arg His Lys Phe His Gly Ser Gly Leu Val Gin Lys Lys Thr 

15 10 15 

Gly Ser lie Pro Leu Glu He Glu Leu Thr Gin Lys Gly He Val Thr 

20 25 30 

Lys Val Asn His Leu Lys Gin Ala Arg Leu Ser Gly Tyr Val Gly His 

35 40 45 

Val Gin Val Val Ser Leu Ala Pro Glu Asp Leu Gin He He Lys Gly 

50 55 60 

Ala Pro Trp He Arg Arg 
65 70 

(2) INFORMATION FOR SEQ ID NO: 438: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 

Met Arg Gly He His Phe Val Gin He Pro Thr Ser Leu Thr Ala Gin 

1 5 10 is 

Val Asp Ser Ser He Gly Gly Lys Thr Gly Val Asn Thr Pro Phe Ala 

20 25 30 

Lys Asn Met Val Gly Thr Phe Ala Gin Pro Asp Gly Val Leu He Asp 

35 40 45 

Pro Leu Val Leu Glu .Thr Leu Gly Lys Arg Glu Leu He Glu Gly Met 

50 55 60 

Gly Glu Val He Lys Tyr Gly Leu He Glu Asp Pro Glu Leu Trp Ala 
65 70 75 80 

Leu Leu Thr Gly Leu Asn Gly Ser Val Glu Ser He Leu Glu His Ala 

85 90 95 

Glu Thr Leu He Glu His Ser Cys Gin Val Lys Arg Lys Met Val Val 

100 105 HO 

Glu Asp Glu Leu Asp Asn Gly He Arg Leu Tyr Leu Asn Phe Gly His 
115 120 125 

397 



WO 97/43303 



PCT/US97/07950 



Thr He Gly His Ala lie Glu Ala Thr Ala Gly Tyr Gly Lys Val Met 
130 135 140 

His Gly Glu Ala Val Ala Met Gly Met Val Gin He Ser Lys lie Ala 

14 5 150 -ice 

155 160 
Glu Glu Lys Gly Leu Met Pro Ala Gly lie Thr Gin Ser lie Thr Glu 

165 170 175 

Met Cys Gin Lys Phe Gly Leu Pro Val Asp Tyr Glu Asn Trp Glu Val 

180 "5 190 

Asp Lys Leu Tyr Gin Ala Leu Thr His Asp Lys Lys Ala Arg Gly Asn 

195 200 205 

Thr Leu Lys Leu Val Leu Val Pro Glu Leu Gly Ser Ala Thr He His 

210 215 220 

Pro Val Ser Leu Glu Glu Met Lys Asp Tyr Leu Val Lys 
225 230 235 

<2> INFORMATION FOR SEQ ID NO: 43 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



Gly 



(ii) MOLECOXE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439 : 

Met Gly Lys Glu Lys Val Trp Leu Pro Arg Lys Gin Gly Leu Pro 

Leu Ala Tyr Tyr Val lie Glu Val Ala His Lys Glu Glu Leu Leu Thr 

20 25 30 

He Ala Gin Arg Ala Gin Glu Val Asp Val Pro He Lys Trp Met Thr 

35 40- 45 

Ser Ser Gin Leu Glu He Thr Asp Ser Asp Gly He Val Thr Cys He 

50 55 60 

Arg Leu Ala Arg 
65 



(2) INFORMATION FOR SEQ ID NO: 440: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii> MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 

Met Cys He Thr Pro Val Ala Ser Leu Ala Pro Asp Val Phe Cys Val 

Ser Met Asn Gly Leu Ser Lys Ser His Arg He Ala Gly Phe Arg Val 

20 25 30 

Gly Trp Met Val Leu Ser Gly Pro Lys Thr His Val Lys Gly Tyr He 

35 40 45 

Glu Gly Leu Asn Met Leu Ser Asn Met Arg Leu Cys Ser Asn Val Leu 

50 55 60 

Ala Gin Gin Val Val Gin Thr Ser Leu Gly Gly His Gin Ser Val Asp 
65 70 75 80 

Glu Leu Leu Leu Ser Trp Trp Thr Asn Leu Arg Ala Lys Lys Phe His 
85 90 95 

Leu 



(2) INFORMATION FOR SEQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 

Met Arg Leu Glu Gin Asp Cys Pro Val Phe Leu Lys He Lys Glu Lys 

15 10 is 

Asp Met Ala Ser Lys Met Leu His Thr Cys Leu Arg Val Glu Asn Leu 

20 25 30 

Glu Lys Ser He Ala Phe Tyr Gin Asp Ala Phe Gly Phe Lys Glu Leu 

35 40 45 

Arg Arg Arg Asp Phe Pro Asp His Ala Phe Thr He Val Tyr Leu Gly 
50 55 60 
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Leu Glu Gly Asp Asp Tyr Glu Leu Glu Leu Thr Tyr Asn Tyr Asp His 
65 7 ° 75 80 

Gly Pro Tyr Val Val Gly Asp Gly Phe Ala His He Ala Leu Ser Thr 

85 90 95 

Pro Asp Leu Glu Ala Leu His Gin Glu His Ser Thr Lys Gly Tyr Glu 

100 105 no 

Val Thr Glu Pro Asn Gly Leu Pro Gly Thr Ala Pro Asn Tyr Tyr Phe 

115 120 125 

Val Lys Asp Pro Asp Gly Tyr Lys Val Glu Val He Arg Glu Lys 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 442 : 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442: 

Met Leu Asp Gly Met Asp He Ser Asn Thr Tyr Gin Asp Lys Leu Glu 

1 5 io 15 

Glu Leu Glu Lys Glu Ala Lys Thr Val Val Phe Leu Ser Cys 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 443: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 72 amino acids 
<B) TXPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443 : 

Met Ala Thr Phe Tyr Val Pro Ala Val Asn Leu He Gly Lys Gly Val 
1 5 10 15 
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Val Asn Glu Val Gly Pro Tyr lie Lys Glu Leu Gly Tyr Lys Lys Ala 
20 25 30 

Leu Leu Val Thr Asp Lys Tyr He Glu Gly Ser Asp He Leu Pro Lys 

35 40 45 

Val Leu Lys Pro Leu Asp Thr Glu Gly He Glu Tyr Val He Phe Ser 

50 55 60 

Asp Val Asp Ala Lys Pro Tyr Leu 
65 70 

(2) INFORMATION FOR SEQ ID NO:444: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 

Met Pro Ser Phe Asp Ala Gin His Pro Glu He Leu Glu Thr He Arg 

1 5 io 15 

Asp Thr Lys Asp Leu Pro Glu Glu Ala Val Leu Asp Ala Ala He Thr 

20 25 30 

Glu Phe Leu Asn Gin Ser Ser Phe Pro He Arg He Glu Val Ser Asp 

35 40 45 

Gly Ser Val Ser Lys 
50 

(2) INFORMATION FOR SEQ ID NO: 445: 

( i ) SEQUENCE -CHARACTERISTICS : 
(A) LENGTH: 188 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 
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Met lie Leu Leu lie Asp Asn Tyr Asp Ser Phe Thr Tyr Asn Leu Ala 

Gin Tyr lie Gly Asn Phe Ala Glu val Gin Val Leu Arg Asn Asp Asp 

20 25 30 

Ser Lys Leu Tyr Glu Glu Ala Glu Lys Ala Asp Gly Leu Val Phe Ser 

35 40 45 

Pro Gly Pro Gly Trp Pro Val Asp Ala Gly Lys Met Glu Asp Met He 

50 55 go 

Arg Asp Phe Ala Gly Lys Lys Pro He Leu Gly He Cys Leu Gly His 
65 70 75 80 

Gin Ala lie Ala Glu Val Phe Gly Gly Lys Leu Gly Leu Ala Pro Lys 

85 90 95 

Val Met His Gly Lys Gin Ser Asn He Asn Phe Glu Ala Pro Ser Val 

100 105 110 

Leu Tyr Gin Gly He Glu Asp Gly Arg Ala Val Met Arg Tyr His Ser 

US 120 12 5 

He Leu lie Glu Glu Met Pro Glu Asp Phe Glu Val Thr Ala Arg Ser 

«0 135 140 

Thr Asp Asp Gin Ala He Met Gly He Gin His Lys Asn Leu Pro He 

I 45 150 ice 

155 i 6 o 
Tyr Gly Phe Gin Tyr His Pro Glu Ser He Gly Thr Pro Asp Gly Leu 

165 170 i7 5 

Ser Ser He Arg Asn Phe He Glu Glu Val Val Lys 
180 18 5 

(2) INFORMATION FOR SEQ ID NO: 446 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 136 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 46: 

Leu Leu Ala Phe Val Ala Trp Asn Phe Val Thr Phe Phe Leu Cys Pro 

1 5 10 1C 

Pro Thr Arg Asn Glu He Ser Tyr Gin His Ala Ala Pro Thr Phe Thr 

20 25 30 

Gly Val Thr Ala Phe Leu Met Tyr Phe Phe Tyr Pro Val He Ala Gly 
35 40 45 
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Pro He Phe Glu Asp Met He Tyr Arg Gly Leu Val Met Thr Ala Leu 

50 55 60 

Glu Lys Gly Lys Lys Trp Gly Leu Asp Val Leu Gly Ser Ala Val Leu 
65 70 75 80 

Phe Gly Val Ser His He Ser Asn His Gly Trp Val Leu Thr Asp Phe 

85 90 95 

Val Phe Tyr Met Gly Gly Gly Leu He Phe Ala Val Leu Phe Arg Met 

100 105 HO 

Thr Lys Ser He Tyr Trp Pro He Gly Leu His He Val Tyr Asn Gly 

115 120 125 

He Gly Gin Leu Leu Met Leu Leu 
130 135 



(2) INFORMATION FOR SEQ ID NO: 447: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 



Met Thr Val Asp Arg Glu Gly Phe Glu Ala Ala Met Lys Glu Gin Gin 

15 io is 

Glu Arg Ala Arg Ala Ser Ala Val Lys Gly Gly Ser Met Gly Met Gin 

20 25 30 

Asn Glu Thr Leu Gin Asn He Thr Val Glu Ser Ala Phe Asn Tyr Asn 

35 40 45 

Ala Ser Gin Leu Ser Ser Lys Leu Val Ala lie Val Ala Asp Asn Ala 

50 55 60 

Glu Val Glu Ala Val Ser Glu Gly Thr Ala Ser Leu He Phe Ala Glu 
65 70 75 80 

Thr Ser Phe Tyr Ala Glu Met Gly Gly Gin Val Ala Asp Tyr Cly Gin 

85 90 95 

He Leu Asp Glu Ser Gly Lys Val Val Ala Thr Val Thr Asn Val Gin 

100 105 no 

Lys Ala Pro Asn Gly Gin Ala Leu His Thr Val Glu Val Leu Ala Pro 

115 120 125 

Leu Ala Leu Asn Gin Glu Tyr Thr Leu Ala He Asp Ser Asn Arg Arg 
130 135 140 
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His Arg Val Met Lys Asn His Thr Ala Thr His Leu Leu His Ala Ala 

145 150 ice 

155 160 

Leu His Asn He Leu Gly Asn His Ala Thr Gin Ala Gly Ser Leu Asn 

165 170 175 

Glu Val Glu Phe Leu Arg Phe Asp Phe Thr His Phe Gin Ala Val Thr 

180 185 190 

Ala Glu Glu Leu Arg Ala He Glu Gin Gin Val Asn Glu Lys He Trp 

195 200 205 

Glu Ala Leu Glu Val Lys Thr Val Glu Thr Asp lie Asp Thr Ala Lys 

210 215 220 

Glu Asn Gly Arg Ser Asn Ser Cys Ser Pro Gly Gly Ser Thr Asn Ser 

225 230 -jic 

235 240 

Xaa Xaa Ala Ala Thr Ala Val Glu Leu His Phe Xaa Phe Pro Leu Met 

245 250 255 

Lys Val Asn Cys Xaa Leu Gly Leu Xaa Trp Pro Xaa Arg Phe Pro Phe 
260 265 270 

(2) INFORMATION FOR SEQ ID NO: 448: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 8: 



Met Asp Ala Glu Val Ser Lys Asn Leu Arg Leu lie Leu Glu Arg Lys 

1 5 10 15 

Gly Met Thr He Leu Thr Gly Thr Lys Leu Gin Glu He lie Glu Glu 

20 25 30 

Asn Gly Gin Leu Arg He Lys Val Glu Gly Lys Asp Asn He He Ala 

35 40 45 

Ser Lys Ala Leu Leu Ser He Gly Arg Met Pro Asp Leu Glu Gly He 

50 55 60 

Gly Glu Val Glu Phe Glu Leu Asp Arg Gly Cys He Lys Val Asn Glu 
" 70 75 80 

Tyr Met Glu Thr Ser Val Pro Gly He Tyr Ala Pro Gly Asp He Asn 

85 go 95 

Gly Thr Lys Met Leu Ala His Ala Ala Phe Arg Met Gly Glu Val Ser 
100 105 no 
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Ala Glu Asn Ala Leu Lys Gly Asn His Ala Val Ala Lys Leu Asn Leu 

115 120 125 

Thr Pro Ala Ala He Tyr Thr Leu Pro Glu Val Ala Ala Val Gly Leu 

130 135 140 

Thr Glu Glu Gin Ala Arg Glu Lys Tyr Asp Val Ala He Gly Lys Phe 
1^5 150 155 160 

Asn Phe Ala Ala Asn Gly Arg Ala He Ala Ser Asp Ala Ala Gin Gly 

165 170 175 

Phe Val Lys Val He Ala Asp Lys Lys Tyr Gly Glu He Leu Gly Val 

180 185 190 

His He He Val Pro Ala Ala Ala Glu Leu He Asn Glu Ala Ser Ser 

195 200 205 

He He Glu Met Glu He Thr Val Glu Glu Met Leu Lys Thr He His 

210 215 220 

Gly His Pro Thr Tyr Phe Glu Val Met Tyr Glu Ala Phe Ala Asp Val 
225 230 235 240 

Leu Gly Met Ala He His Ser Pro Lys Lys Lys 



(2) INFORMATION FOR SEQ ID NO: 449: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 

Leu He Val Glu Thr He Leu Gly He Leu Gly Ser Arg Gly He His 

1 .5 io 15 

Leu Ala Val Gin Gly Cys Glu His Val Asn Arg Ala Leu Val Val Glu 

20 25 30 

Arg Gin Val Ala Glu Gin Phe Asp Leu Glu He Val Ser Val His Pre 

35 40 45 

Thr Leu His Ala Gly Gly Ser Gly Gin Leu Ala Ala Phe Lys Phe Met 

50 55 60 

Gin Asp Pro Val Glu Val Glu Phe He Lys Ala His Ala Gly Leu Asp 
65 70 75 80 

He Gly Asp Thr Ala He Gly Met His Val Lys His Val Gin Val Pro 



245 



250 



85 



90 



95 
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lie Arg Pro lie Leu Arg Glu He Gly His Ala His Val Thr Ala Leu 

100 105 110 

Ala Ser Arg Pro Lys Leu lie Gly Gly Ala Arg Ala His Tyr 



"5 120 



Pro Gin 

125 

Asp Ala He Arg Lys Ser 
130 



(2) INFORMATION FOR SEQ ID NO:450: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 

Met Lys lie Val Ser Gly lie Tyr Gly Gly Arg Pro Leu Lys Thr Leu 
1 5 10 



15 



Glu Gly Lys Thr Thr Arg Pro Thr Ser Asp Lys Val Arg Gly Ala lie 

20 25 30 

Phe Asn Met He Gly Pro Tyr Phe Glu Val Gly Arg Val Leu Asp Leu 

35 « 45 

TVr Ala Gly Ser Gly Gly Leu Ser lie Glu Ala Val Ser Arg Gly Met 

50 55 60 

Ser Ser Ala Val Leu Val Glu Arg Asp Arg Lys Ala Gin Thr He Arg 
65 

Gly 



65 70 73 80 



(2) INFORMATION FOR SEQ ID NO: 451 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451 : 
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Met Arg Phe Val Lys Glu Phe Gly Ala Val Phe Leu lie Gly lie Gly 

15 10 is 

Gly Glu Leu Pro Asp Gly Lys Pro His Asp Gly Arg Ala Pro Asp Tyr 

20 25 30 

Asp Asp Trp Thr Ser Glu Ser Glu Asn Gly Tyr Lys Gly Leu Asn Gly 

35 40 45 

Asp He Leu Val Trp Asn Glu Ser Leu Gly Gly Ala Phe Glu Leu Ser 

50 55 60 

Ser Met Gly He Arg Val Asp Glu Glu Thr Leu Arg Arg Gin Val Glu 
65 70 75 80 

He Thr Gly Asp Glu Asp Arg Leu Glu Leu Glu Trp His Lys Ser Leu 

85 90 95 

Leu Asn Gly Leu Phe Pro Leu Thr He Gly Gly Gly He Gly Gin Ser 

100 105 HO 

Arg Met Ala Met Phe Leu Leu Arg Lys Arg His He Gly Glu Val Gin 

115 120 125 

Thr Ser Val Trp Pro Gin Val Ser Pro Arg Tyr Leu Arg Lys Tyr Phe 

130 135 140 

Val Glu Asn Arg Thr Ala Arg Phe Gly Phe Leu Ser Leu Phe Val Tyr 
145 150 155 160 

Asn Leu Val 



(2) INFORMATION FOR SEQ ID NO: 452: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 

Leu Phe Glu Val Glu Glu Val Ser Asp His Gly Leu Thr Leu Tyr Thr 

15 10 15 

Glu Arg Thr Asp Gly Thr Gly Gly Phe Pro Gly Asn Leu Lys He Trp 

20 25 30 

He Ser Tyr His Leu Glu Glu Thr Gly Ala Ser Met Asn Ser He Tyr 
35 40 45 
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Lys Val Thr Thr Asp Gin Asp Thr Leu Val Asn Pro Thr Asn His Ser 

50 55 60 

Tyr Phe Asn Leu Ser Gly Asp Phe Thr Gin Thr lie Asp Arg His Val 

" 70 7 * 80 

Phe Gin Leu Asn Thr Glu Gly u e Tyr Ser lie Ala Pro Asp Gly Val 

85 90 95 

Pro Ala Lys Thr Pro Glu Ala Asn Arg Asp Val Val Lys His He Tyr 

100 105 110 

Asn Gly Ala Leu Leu Lys Asp He Phe Ala Glu Glu Asp Glu Gin lie 

115 120 12 5 

Gin Leu Ala Ser Gly Leu Asp His Pro Phe Ala Leu Pro Ala Gly His 

"0 135 140 

Asp Asn Ala Gly Phe Leu Tyr Asp Gin Asn Ser Gly Arg Phe Leu Leu 
" 5 150 155 160 

Phe Lys Thr Glu Ala Pro Cys Phe Val Val Tyr Thr Ala Asn Phe Val 

165 170 175 

Asp Glu Ser Val lie lie Gly Gly Gin Pro Met Leu Gin His Asn Gly 

"0 185 190 

lie Ala Leu Glu Ala Gin Ala Leu Pro Asp Ala lie His Ser Asp Leu 

195 200 205 

Lys Gly Gin Val lie Leu Lys Ala Gly Gin Thr Phe Thr Ser Lys Thr 

210 215 220 

Arg Tyr Glu Leu Val Val Lys 
225 230 

(2) INFORMATION FOR SEQ ID NO: 453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 
Met lie Asn Thr Leu Val Gly Met Arg Phe Asp Leu Lys Thr Arg Lys 



15 



Phe 



1 5 10 

Pro lie Leu Ala Asn Gly Thr Gly Gly Met Ser Gly Pro Ala Val 

20 25 30 

Pro Val Ala Leu Lys Leu lie Arg Gin Val Ala Gin Thr Thr Asp Leu 
35 40 45 
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Pro lie He Gly Met Gly Gly Val Asp Ser Ala Glu Ala Ala Leu Glu 

50 55 60 

Met Tyr Leu Ala Gly Ala Ser Ala He Gly Val Gly Thr Ala Asn Phe 
65 70 75 80 

Thr Asn Pro Tyr Ala Cys Pro Asp He He Glu Asn Leu Pro Lys Val 

85 90 95 

Met Asp Lys Tyr Gly He Ser Ser Leu Glu Glu Leu Arg Gin Glu Val 

100 105 no 

Lys Glu Ser Leu Arg 
115 



(2) INFORMATION FOR SEQ ID NO: 454: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii> MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454: 



Met Tyr Asn Lys Val He Met He 

1 5 
Leu His Lys Thr Asn Asn Asp Lys 
20 

Val Asn Arg Arg Tyr Lys Asp Gin 

35 40 
Val Gin Tyr Gly Pro Tyr Gly Ala 

50 55 
Ser Leu Arg Gin Pro Lys Gly Ser 
65 70 



Gly Arg Leu Thr Ser Thr Pro Glu 

10 15 
Ser Val Ala Arg Ala Thr He Ala 
25 30 
Asn Gly Glu Arg Glu Ala Asp Phe 
45 

Arg Thr Ser Gin Lys Thr Leu Ala 
60 

Leu His Phe Arg 
75 



(2) INFORMATION FOR SEQ ID NO:455: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 

Met Val Lys Ser Ser He Gin Gly Pro Arg Leu Ser Lys He Trp Gly 
5 

Ser Thr Arg Gin Glu Ser Tyr Arg Leu Cys Leu Gly Ala Arg Leu Val 

20 25 30 

Tyr Ser Ala Thr Lys Ala Ala Val Lys Thr Phe Ser Asp Gly Leu Arg 

35 40 45 

He Asp Thr lie Ala Thr Asp lie Lys Val Thr Thr lie Gin Pro Gly 

50 55 60 

lie Val Glu Thr Asp Phe Ser Thr Val Arg Phe His Gly Asp Lys Glu 
" 70 75 80 

Arg Ala Ala Ser Val Tyr Gin Gly lie Glu Ala Leu Gin Ala Gin 



85 9 0 



Asp 



He Ala Asp Thr Val Val Tyr Val Thr Ser Gin Pro Arg Arg Val Gin 

100 105 110 

lie Thr Asp Met Thr lie Met Ala Asn Gin Gin Ala Thr Gly Phe Met 

115 120 12 5 

He His Lys Lys 
130 



(2) INFORMATION FOR SEQ ID NO: 456: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 179 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 

Met Lys Ser Glu Leu Arg Lys Gin Val Leu His Glu Met Lys Ala Leu 

Ser Gin Glu Gin Lys Gin Ala lie Asp Gin Ala Leu Thr Glu Arg He 

20 25 30 

Leu Gin His Pro Phe Tyr Gin Glu Ala Lys Val He Ala Thr Tyr Leu 

35 40 45 

Ser Phe Ser His Glu Phe Gin Thr Arg Glu Leu lie Glu Gin Ala Leu 
50 55 60 
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Lys Asp Gly Lys Arg Val Leu Thr Pro Lys Thr Tyr Pro Lys Gly Arg 
65 70 75 80 

Met Asp Phe Val Val Tyr Asp Pro Gin Gin Leu Val Lys Thr Ser Phe 

85 90 95 

Gly Leu Leu Glu Pro Gin Gly Asp Leu Glu Val Val Asp Val Ser Lys 

100 105 HO 

lie Asp Leu He His Val Pro Gly Leu Ala Phe Thr Thr Lys Gly Tyr 

115 120 125 

Arg He Gly Tyr Gly Gly Gly Tyr Tyr Asp Arg Tyr Leu Glu His Phe 

130 135 140 

Ser Gly Arg Ala Leu Ser Thr He Tyr Pro Cys Gin He Gin Asp Phe 
1^5 150 155 160 

He Pro Glu Asn His Asp lie Pro Val Gin Glu Val Leu He Asp Glu 
165 170 175 

Gly Asn Leu 



(2) INFORMATION FOR SEQ ID NO: 457: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 169 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457 : 

Met Asp Ser Leu Thr Trp Leu Asn Asp Leu He Tyr Pro Met Met Val 

15 io is 

Val Asn Gin Thr He Pro Thr He Ala He Ala Pro He Leu Val Leu 

20 25 30 

Trp Leu Gly Tyr Gly He Leu Pro Lys He Val Leu He He Leu Thr 

35 40 45 

Thr Thr Phe Pro He He Val Ser He Leu Asp Gly Phe Arg His Cys 

50 55 . 60 

Asp Lys Asp Met Leu Thr Leu Phe Ser Leu Met Arg Ala Lys Pro Trp 
65 70 75 80 

Gin He Leu Trp His Phe Lys He Pro Val Ser Leu Pro Tyr Phe Tyr 

85 90 95 

Ala Gly Leu Arg Val Ser Val Ser Tyr Ala Phe He Thr Thr Val Val 
100 105 no 
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Ser Glu Trp Leu Gly Gly Phe Glu Gly Leu Gly Val Tyr Met lie Gin 

115 120 125 

Ser Lys Lys Leu Phe Gin Tyr Asp Thr Met Phe Ala lie lie He Leu 

130 135 140 

Val Ser He lie Ser Leu Leu Gly Met Lys Leu Val Asp lie Ser Glu 
145 150 
Lys Tyr Val He Lys Trp Lys Arg Ser 
165 



155 



160 



(2) INFORMATION FOR SEQ ID NO: 458 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 

Met Asn Val Gly Tyr Glu Lys Ala Phe Leu Met Gin Thr Ser Leu Asn 

15 10 15 

Leu Pro Thr Ser Glu lie lie Ser Thr Tyr Val Tyr Lys Val Gly Leu 

20 25 30 

val Ser Gly Asp Tyr Ser Tyr Ser Thr Ala Val Gly Leu Phe Asn Ala 

35 40 45 

Val He Asn Val Val Leu Leu Val Ala Val Asn Gin He Val Lys Arg 

50 55 eo 

Met Asn Asn Gly Glu Gly He 
« 70 



(2) INFORMATION FOR SEQ ID NO: 459: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:459: 
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Met Lys Arg Met Leu Ser Gly Leu Asn Val Val Phe Ser Trp Met Phe 

15 10 is 

Thr lie Lys Lys Asn Asp Gly Lys Thr Thr lie Gly Thr Ala Lys Asp 

20 25 30 

His Pro Thr Trp Val Asn Arg Thr Pro Lys Gly Thr Phe Ser Pro Glu 

35 40 45 

Gly Tyr Pro Leu Tyr His Tyr Gin Thr Tyr He Leu Glu Asp Phe He 

50 55 60 

Glu Asp Gly Ser His Arg Asp Gin Leu Asp Glu Ala Thr Thr Gly Thr 
65 70 75 

Asn 



80 



(2) INFORMATION FOR SEQ ID NO: 460: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460 : 

Met Ala Leu Val Tyr Asp Ala Lys Val Ala Glu Ala Lys Arg Leu Ser 

1 5 io is 

Glu Ala Leu Leu Met Leu Asp Thr Ser Arg He Val Ala Arg Gly Tyr 

2 0 25 30 

Ala He Val Lys Lys Glu Glu Ser Val Val Asp Ser Val Glu Ser Leu 

35 40 45 

Lys Lys Lys Asp Gin Val Thr Leu Leu Met Arg Asp Gly Gin Val Glu 

50 55 60 

Leu Glu Val Lys Asp Val Lys Thr Lys Glu He 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 461: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO.-461: 

Met Ser Gin Ala Gin Tyr Ala Gly Thr Gly Arg Arg Lys Asn Ala Val 

1 5 10 „ 

Ala Arg Val Arg Leu Val Pro Gly Thr Gly Lys lie Thr Val Asn Lys 

20 25 30 

Lys Asp Val Glu Glu Tyr He Pro His Ala Asp Leu Arg Leu Val He 

35 40 45 

Asn Gin Pro Phe Ala Val Thr Ser Thr Val Gly Ser Tyr Asp Val Phe 

50 55 60 

Val Asn Val Val Gly Gly Gly Tyr Ala Gly Gin Ser Gly Ala lie Arg 
65 70 75 80 

His Gly He Ala Arg Ala Leu Leu Gin Val Asn Pro Asp Phe Arg Asp 

85 90 95 

Ser Leu Lys Arg Ala Gly Leu Leu Thr Arg Asp Ser Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 462: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 

Met Ala Gly Ala Ser Leu Met Phe Phe Gly Phe Leu Gly Phe Glu Ser 

He Ser Asn Gly Gly Ser Val Asp Glu Val Lys Thr Pro Gin Lys Asn 

20 25 30 

lie Pro Arg Gly lie Val Leu Ser Leu Ser lie Val Thr lie Leu Tyr 

35 40 45 

Ala Leu Val Thr Leu Val Leu Thr Gly Val Val His Tyr Ser His Leu 

• 50 55 60 

Asn Val Asp Asp Ala Val Ala Phe Ala Leu Arg Ser Val Gly lie Ser 

" 70 ^ 80 
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Trp Ala Ala Asn Tyr Val Ser Leu Val Ala He Leu Thr Leu He Thr 

85 90 95 

Val Cys He Ser Met Thr Tyr Ala Leu Ser Arg Met He Tyr Ser Leu 

100 105 no 

Ala Ser Asp Gly Leu Val Pro Ala Ala Phe Lys Glu Leu Thr Lys Thr 

115 120 125 

Ser Lys He Pro Lys Asn Ala Thr He Leu Thr Gly Leu Ala Ser Ala 

130 135 140 

Val Ala Ala Gly Met Phe Pro Leu Ala Ser He Ala Ala Phe Leu Asn 



160 



145 150 155 

He Cys Thr Leu Ala 
165 

(2) INFORMATION FOR SEQ ID NO: 463: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 

Met Asn Lys Lys Glu Arg Leu Glu Lys He Arg Arg Leu Val Thr Asp 

1 5 io i 5 

Tyr Gin He Gly Thr Gin Glu Glu He Val Glu His Leu Lys Glu Ala 

20 25 3 0 

Gly He Thr Ala Thr Gin Ala Thr Val Ser Arg Asp He Lys Glu Leu 

35 40 45 

Gly He Val Lys He Pro Leu Arg Asp Asn Thr Tyr Val Tyr Glu Leu 

50 55 6 o 

Pro Lys Ser He Val Lys Ser Leu Gin Leu Ala Glu Asp Asn He Glu 
65 7 ° 75 80 

Ser Ala Glu Leu Met Asp Lys Met He Asn Leu Gin Val He Pro Gly 

05 90 95 

Asn Thr Ala Phe Val Lys Ala Gin Leu He Glu Thr Phe Ala Asp Lys 

100 105 no 

He Phe Ser Cys Leu Thr Asp Asp Ser Gly Pro Val Xaa Gly 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 464 : 
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(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 111 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 

Met Asn Thr Asn Leu Ala Ser Phe lie Val Gly Leu He lie Asp Glu 

Asn Asp Arg Phe Tyr Phe Val Gin Lys Asp Gly Gin Thr Tyr Ala Leu 

20 25 30 

Ala Lys Glu Glu Gly Gin His Thr Val Gly Asp Thr Val Lys Gly Phe 

35 40 45 

Ala Tyr Thr Asp Met Lys Gin Lys Leu Arg Leu Thr Thr Leu Glu Val 

50 55 go 

Thr Ala Thr Gin Asp Gin Phe Gly Trp Gly Arg Val Thr Glu Val Arg 
" 70 75 80 

Lys Asp Leu Gly Val Phe Val Asp Thr Gly Leu Pro Asp Lys Glu lie 

85 90 95 

Val Val Val Thr Arg Tyr Ser Pro Cys Ala Gin Gly Thr Leu Ala 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 465: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465: 

Met Tyr Leu Lys Glu lie Glu lie Gin Gly Phe Lys Ser Phe Ala Asp 

1 5 10 15 

Lys Thr Lys Val Val Phe Asp Gin Gly Val Thr Ala Val Val Gly Pro 
20 25 30 
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Asn Gly Ser Gly Lys Ser Asn lie Thr Glu Ser Leu Arg Trp Ala Leu 

35 40 . 45 

Gly Glu Ser Ser Val Lys Ser Leu Arg Gly Gly Lys Met Pro Asp Val 

50 55 60 

He Phe Ala Gly Thr Glu Ser Arg Lys Pro Leu Asn Tyr Ala Ser Val 
65 70 75 80 

Val Val Thr Leu Asp Asn His Asp Gly Phe He Lys Asp Ala Gly Gin 

85 90 95 

Glu He Arg Val Glu Arg His He Tyr Arg Ser Gly Asp Ser Glu Tyr 

100 105 no 

Lys He Asp Gly Lys Lys Val Arg Leu Arg Asp He His Asp Leu Phe 

H5 120 125 

Leu Asp Thr Gly Leu Gly Arg Asp Ser Phe Ser He He Ser Gin Gly 

130 135 140 

Lys Val Glu Glu He Phe Asn Ser Lys Pro Glu Glu Arg Arg Ala He 
1« 150 155 160 

Phe Glu Glu Ala Ala Gly Val Leu Lys Tyr Lys Thr Arg Arg Lys Glu 

165 170 175 

Thr Glu Ser Lys Leu Gin Gin Thr Gin Asp Asn Leu Asp Arg Leu Glu 

180 185 190 

Asp He He Tyr Glu Leu Asp Asn Gin He Lys Pro Leu Glu Lys Gin 

195 200 205 

Ala Glu Asn Ala Arg Lys Phe Leu Asp Leu Glu Gly Gin Arg Lys Ala 

210 215 220 

He Tyr Leu Asp Val Leu Val Ala Gin He Lys Glu Asn Lys Ala Glu 
225 230 235 240 

Leu Glu Ser Thr Glu Glu Glu Ser Trp Leu Arg Phe Lys Asn Ser 



(2) INFORMATION FOR SEQ ID NO: 466: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH :. 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466 : 



245 



250 



255 



Met Asp Asp Tyr Asn Ala Thr Gin Ala Ser Thr He Ala Val Met Pro 
15 10 15 
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Val Arg Tyr Glu Glu Val Leu Leu Met Val 
20 25 

(2) INFORMATION FOR SEQ ID NO: 467 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467 : 

Met Arg Ser Leu Asn Phe Met Arg Ala Leu Leu Thr Asp Lys Gin Met 

Asn Tyr lie Glu Leu Tyr Tyr Ala Asp Asp Tyr Ser Leu Ala Glu lie 

20 25 3Q 

Ala Glu Glu Phe Gly Val Ser Arg Gin Ala Val Tyr Asp Asn lie Lys 

35 « 45 

Arg Thr Glu Lys lie Leu Glu Asp Tyr Glu Met Lys Leu His Met Tyr 

Ser Asp Tyr lie Val Arg Ser Gin He Phe Asp Gin lie Leu Asp Arg 



65 70 



Tyr Pro Lys Asp Asp Phe Leu Gin Glu Gin He Glu lie Leu Thr Ser 

85 90 
He Asp Asn Arg Glu 
100 



80 

95 



(2) INFORMATION FOR SEQ ID NO: 468 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 
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Met Gly Trp Glu He Gly He Thr Pro Ser Thr Val Leu Val Ser His 
15 10 15 

Leu Leu Val Gin Glu Ser Thr Ser Pro Asp Leu Leu Val Asn Glu Phe 

20 25 30 

Leu Leu Phe Ala He Gly Thr Gly Phe Ala Leu Leu Val Asn Leu Tyr 

35 40 45 

Met Pro Ser Arg Glu Glu Glu He Gin His Tyr His Thr Leu Val Glu 

50 55 60 

Glu Lys Leu Lys Asp He Leu Gin Arg Phe Lys Tyr Tyr Leu Ser Arg 
65 70 75 80 

Gly Asp Gly Arg Asn Arg Ala Gin Leu Val Ala Glu Leu Asp Thr Leu 

85 90 95 

Leu Lys Glu Ala Leu Arg Leu Val Tyr Leu Asp His Ser Asp His Leu 

100 105 no 

Phe His Gin Thr Asp Tyr His He His Tyr Phe Glu Met Arg Gin Arg 

115 120 125 

Gin Ser Arg He Pro Glu Lys His Gly Pro Thr Asp 
130 135 140 

£2) INFORMATION FOR SEQ ID NO: 469: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 

Met Glu Leu Leu Ala Asn Ala Asn Lys He Met Glu Gly Lys Val Val 

1 5 io „ 

Leu Asn Trp Asp He Gin Tyr Leu Gly Trp Gly Asp Tyr Gly Lys Glu 

20 25 30 

Asn Val Ser Tyr Gin His His Leu Val Lys Thr Met He Ser Pro Leu 

35 40 45 

Gin He Thr He Leu 
50 

(2) INFORMATION FOR SEQ ID NO: 470: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 

Met Glu Arg Asn Asn Arg Lys Val Leu Val Gly Arg Val Val Ser Asp 

10 15 
Lys Met Asp Lys Thr lie Thr Val Val Val Glu Thr Lys Arg Asn His 

20 « 30 

Pro val Tyr Gly Lys Arg lie Asn Tyr Ser Lys Lys Tyr Lys Ala His 

35 40 45 

Asp Glu Asn Asn Val Ala Lys Glu Gly Asp He Val Arg He Met Glu 

50 55 60 

Thr Arg Pro Leu Ser Ala Thr Lys Arg Phe Arg Leu Val Glu Val Val 
65 70 75 

Glu Glu Ala Val He He 
85 



80 



(2) INFORMATION FOR SEQ ID NO: 4 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 

Met His Arg Lys Pro Arg Gly Val Asp Phe Gly Glu Gly Ser He Val 

J 5 « 15 

Gin Lys Val Lys Ala Met He Pro lie Leu He Leu Leu Phe Ala Thr 

20 25 30 

Ser Leu Lys Arg Ala Asp Ser Leu Ala He Ala Met Glu Ala Arg Gly 

35 40 45 

T*r Gin Gly Gly Lys Gly Arg Ser Gin Tyr Arg Gin Leu Lys Trp Thr 
50 55 60 
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Leu Lys Asp Thr Leu Thr lie Leu Val lie Leu Val Leu Gly Cys Cys 
65 70 75 80 

Leu Phe Phe Leu Lys Ser 
85 

(2) INFORMATION FOR SEQ ID NO: 472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 

Met Asn Phe Thr Leu lie Asn Trp Arg lie Arg Met Gin Tyr Leu Glu 

15 10 15 

Lys Lys Glu lie Lys Glu lie Gin Leu Ala Leu Leu Asp Tyr He Asp 

20 25 30 

Glu Thr Cys Lys Lys His Asp He Pro Tyr Phe Leu Ser Tyr Gly Thr 

35 40 45 

Met Leu Gly Ala He Arg His Lys Gly Met He Pro Trp Asp Asp Asp 

50 55 60 

He Asp He Ser Leu Tyr Arg Glu Asp Tyr Glu Arg Leu Leu Lys He 
65 70 75 80 

He Glu Glu Glu Asn His Pro Arg Tyr Lys Val Leu Ser Tyr Asp Thr 

85 90 95 

Ser Ser Trp Tyr Phe His Asn Phe Ala Ser He Leu Asp Thr Ser Thr 

100 105 110 

Val He Glu Glu His Val Lys Tyr Lys Arg His Asp Thr Ser Leu Phe 

115 120 125 

He Asp Val Ser His 
130 

(2) INFORMATION FOR SEQ ID NO: 47 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:473: 

Leu Arg Val Ala Tyr Phe Lys Val His His Pro He Tyr Tyr Tyr Cys 

Ala Tyr Phe Ser He Arg Ala Lys Ala Phe Asp lie Lys Thr £ 01y 

20 25 
Ala Gly Leu Glu Val He Lys Arg Arg Met Glu Glu He H Glu Lys 

" 40 
Arg Lys Asn Asn Glu Ala Ser Asn Val Glu He Asp Leu Tyr Thr Thr 

50 55 60 

Leu Glu He Val Asn Glu Met Trp Glu Arg Gly Phe Lys Phe Gly Lys 

70 75 80 

Leu Asp Leu Tyr Cys Ser Gl n Thr Thr Glu Phe Leu He Asp Gly Asp 

85 £Q 

Thr Leu He Pro Pro Phe Val Ala Met Asp Gly Leu Gly Glu JL Val 
100 "5 110 

15 120 125 

Thr Glu Leu Arg Lys Arg Gly Gly Leu Ser Ser Thr Leu Val Glu Lys 



135 



140 



Met Asp Glu Met Gly He Leu Gly Asn Met Pro Glu Asp Asn Gin Leu 



150 



Ser Leu Phe Asp Glu Leu Phe 
165 



155 



160 



(2) INFORMATION FOR SEQ ID NO: 474 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474: 
Met Ser Glu Asn Gin Gin Ala Leu Asn His Val Val Ser Met Glu Asp 



10 



15 
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Leu Thr Val Asp Gin Val Met Lys Leu He Lys Arg Gly He Glu Phe 

20 25 30 

Lys Asn Gly Ala Gin He Pro Tyr Glu Asp His Pro He Val Ser Asn 

35 40 45 

Leu Phe Phe Glu Asp Ser Thr Arg Thr His Lys Ser Phe Glu Val Ala 

50 55 60 

Glu He Lys Leu Gly Leu Glu Arg Leu Asp Phe Asp Val Lys Thr Ser 
65 70 75 80 

Ser Val Asn Lys Gly 
85 

(2) INFORMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:475: 

Met Ser Met Phe Leu Asp Thr Ala Lys He Lys Val Lys Ala Gly Asn 

1 5 10 15 

Gly Gly Asp Gly Met Val Ala Phe Arg Arg Glu Lys Tyr Val Pro Asn 

20 25 30 

Gly Gly Pro Trp Gly Gly Asp Gly Gly Arg Gly Gly Asn Val Val Phe 

35 40 45 

Val Val Asp Glu Gly Leu Arg Thr Leu Met Asp Phe Arg Tyr Asn Arg 

50 55 60 

His Phe Lys Ala Asp Ser Gly Glu Lys Gly Met Thr Lys Gly Met His 
fi 5 70 75 80 

Gly Arg Gly Ala Glu Asp Leu Arg Val Arg Val Pro Gin Gly Thr Thr 

85 90 95 

Val Arg Asp Ala Glu Thr Gly Lys Val Leu Thr Asp Leu He Glu His 

100 105 no 

Gly Gin Glu Phe He Val Ala His Gly Gly Arg Gly Gly Arg Gly Asn 

115 120 125 

He Arg Phe Ala Thr Pro Lys Asn Pro Ala Pro Glu He Ser Glu Asn 

130 135 140 

Gly Glu Pro Gly Ser Gly Thr 
145 150 
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(2) INFORMATION FOR SEQ ID NO: 476: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476: 

Met Gly Ser Phe Leu Met Pro Lys Ser Glu Val Tyr Met Lys Gin Glu 

5 10 15 

Ser Thr Val Asp Leu Leu Leu Asn v*i *«*~ ^ * 

^eu i,eu Asp Val Asp Gin Arg Pro Ser Asp Gly 

^ 25 
Lys Gly He Leu Leu Ser Phe Gin His Val Phe Ala Met Phe Gly Ma 

40 45 
Thr lie Leu Val Pro Leu He Leu Gly Met Pro Val Ser Val Ala Leu 

55 60 
Leu cys Phe Gin Leu Phe Gly Thr Leu He Leu His Asp Cys ,y r Trp 



65 
Phe 



70 



75 



80 



(2) INFORMATION FOR SEQ ID NO: 477; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:477 : 
Met Ala Asn Tyr Asp lie Phe Thr Ser Pro Pro Thr Asn Ser Gly He 
Ala Pro Val Thr Val lie Val Lys Lys Ser Tyr Gly Phe Vyr rl Glu 



20 



25 



30 
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Ala Lys Thr Phe His Thr Thr Arg Phe Gly Thr He Val Leu His Ser 
35 40 45 

Arg Lys Gin Asn He Pro Asp He He Ala Leu His Thr Ala Pro Pro 

50 55 60 

Leu Pro Gly Leu Met Glu He Trp Lys Gin Asp Leu Asn He He His 
65 70 75 80 

Asn Gin Leu Ala Ser Lys Tyr Pro Lys Ala He He Ala Gly Asp Phe 

85 90 95 

Asn Ala Thr Met Arg His Gly Ala Leu Ala Lys He Ser Ser His Arg 

100 105 110 

Asp Ala Leu Asn Val Leu Pro Pro Phe Glu Arg Gly Thr Trp Asn Ser 

115 120 125 

Gin Ser Pro Lys Leu Phe Asn Ala Thr He Asp His He Leu Leu Pro 

130 135 140 

Lys Asn His Tyr Tyr Val Lys Asp Leu Asp He Val Ser Phe Gin Asn 
145 150 155 160 

Ser Asp His Arg Cys He Phe Thr Glu He Thr Phe 
165 170 



(2) INFORMATION FOR SEQ ID NO: 478 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478: 



Met Asn Leu Gly Ala Asp Gly Val He Ser Val Ala Ser His Thr Asn 

1 . 5 10 15 

Gly Asp Glu Met His Glu Met Phe Thr Ala He Ala Glu Ser Asp Met 

20 25 30 

Lys Lys Ala Ala Ala He Gin Arg Lys Phe He Pro Lys Val Asn Ala 

35 40 45 

Leu Phe Ser Tyr Pro Ser Pro Ala Pro Val Lys Ala He Leu Asn Tyr 

50 55 60 

Met Gly Phe Glu Ala Gly Pro Thr Arg Leu Pro Leu Val Pro Ala Pro 
S5 70 75 80 

Glu Glu Asp Val Lys Arg He He Lys Val Val Val Asp Gly Asp Tyr 
85 90 95 
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Glu Ala Thr Lys Ala Thr Val Thr Gly Val Leu Arg P ro Asp Tyr 



100 



105 



110 



(2) INFORMATION FOR SEQ ID NO: 479 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479 : 

Met Glu Ser Leu lie Gin Thr Tyr Leu Pro Asn Val Tyr Lys Met Gly 

Trp Ser Gly Gin Ala Gly Trp Gly Thr Ala lie Tyr Leu Thr IL Tyr 
20 



25 



Met Thr Val Leu Ser Phe lie lie Gly Gly Phe Leu Gly 



35 



40 



30 

Leu Val Ala 



Gly Leu Phe Leu Val Leu Thr Ala Pro Gly Gly Val Le U G lu Asn Lys 

" 60 
Val val Phe Trp He Leu Asp Lys Ue Thr ser He Phe Arg Ala Val 

70 75 80 

Pro Phe lie lie Leu Leu Ala He Leu Ser Pro Leu Ser His Leu lie 

85 90 95 

Glu Lys Thr Ser lie Gly Pro Asn Ala Ser Pro Cys Pro Thr Phe Phe 

100 105 110 

Cys Ser Leu Cys Leu Leu Cys Pro Ser Gly Ala Gly Cys Leu Gly 



120 



125 



(2) INFORMATION FOR SEQ ID NO:480: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino aci-i 

(C) STRANDEDNESS: f ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480: 
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Met Asp Ser Gin Asp Asn Lys Arg Lys Trp Lys Asn Arg Asp Leu lie 

15 10 15 

Ser Ser Leu Glu Phe Ala He Thr Gly He Phe Thr Ala He Lys Glu 

20 25 30 

Glu Arg Asn Met Arg Lys His Ala Val Thr Ala Leu Val Val lie Leu 

35 40 45 

Ala Gly Phe Gly Phe Gin Val Ser Arg He Glu Trp Leu Phe Leu Leu 

50 55 60 

Leu Ser He Phe Leu Val Val Ala Phe Glu He lie Asn Ser Ala He 
65 70 75 80 

Glu Asn Val Val Asp Leu Ala Ser His Tyr His Phe Ser His Ala Gly 



(2) INFORMATION FOR .SEQ ID NO: 481: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 307 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 81: 

Leu Ser Glu He Asp Glu Val He Leu Val Gly Gly Ser Thr Arg He 

1 5 10 15 

Pro Ala Val Val Glu Ala Val Lys Ala Glu Thr Gly Lys Glu Pro Asn 

20 25 30 

Lys Ser Val Asn Pro Asp Glu Val Val Ala Met Gly Ala Ala He Gin 

35 40 45 

Gly Gly Val He Thr Gly Asp Val Lys Asp Val Val Leu Leu Asp Val 

50 55 60 

Thr Pro Leu Ser Leu Gly He Glu Thr Met Gly Gly Val Phe Thr Lys 
6 5 70 75 80 

Leu He Asp Arg Asn Thr Thr He Pro Thr Ser Lys Ser Gin Val Phe 

85 90 95 

Ser Thr Ala Ala Asp Asn Gin Pro Ala Val Asp He His Val Leu Gin 

100 105 no 

Gly Glu Arg Pro Met Ala Ala Asp Asn Lys Thr Leu Gly Arg Phe Gin 



85 



90 



95 



115 



120 



125 
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Leu Thr Asp He Pro Ala Ala Pro A rg G l y Ile Pro Gln Qlu ^ 

130 135 140 

Thr Phe Asp lie Asp Lys Asn Gly n e Val ser Val Lys Ala Lys Asp 

150 155 160 

Leu Gly Thr Gin Lys Glu Gin Thr He Val n e Gin Ser Asn Ser Gly 

" 5 170 175 

Leu Thr Asp Glu Glu lie Asp Arg Met Met Lys Asp Ala Glu Ala Asn 
180 



185 190 



Ala Glu Ala Asp Lys Lys Arg Lys Glu Glu Val Asp Leu Arg Asn Glu 

Val Asp Gin Ala lle^he Ala Thr Glu Lys Thr He Lys Glu Thr Glu 

210 2 " 220 

Gly Lys Gly Pne Asp Ala Glu Arg Asp Ala Ala Gin Ala Ala Leu Asp 



230 



235 



240 



Asp Leu Lys Lys Ala Gin Glu Asp Asn Asn Leu Asp Asp Met Lys Ala 



245 



250 



255 



Lys Leu Glu Ala Leu Asn Glu Lys Ala Gin Gly Leu Ala Val Lys Leu 

260 265 270 

Tyr Glu Gin Ala Ala Ala Ala Gin Gin Ala Gin Glu Gly Ala Glu Gly 
275 



280 



Ala Gin Ala Thr Gly Asn Ala Gly Asp Asp Val Val 



290 
Thr Glu Lys 
305 



295 



285 

Asp Gly Glu Phe 



300 



(2) INFORMATION FOR SEQ ID NO: 482: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 482: 

Met lie Leu Ala Phe Leu Leu Ser Leu Cys Ser Glu Ala Asp Ala Phe 

He Gly Ala Ser Leu Leu Ser Ser Phe Gly Leu Ala Pro Val IL Ala 

20 25 
Phe Leu yal lie Gly Pro Met Leu Asp il e L ys Asn lie Leu Met Met 



35 



40 



45 
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Lys Asn Tyr Leu Lys Ala Arg Phe He Ser His Phe He Thr He Val 

50 55 60 

Thr Leu Val Val Leu Val Tyr Ser Leu Leu He Gly Val He Leu 



(2) INFORMATION FOR SEQ ID NO:483: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483: 

Leu Tyr Asn He He Phe Leu Ser Ser Met Trp Glu Pro Tyr Gly Gin 

1 5 10 15 

Leu Ser Asp Leu Pro Val Ala Val Val Asn Asn Asp Lys Glu Ala Ser 

20 25 30 

Tyr Asn Gly Asn Thr Met Ala He Gly Lys Asp Met Val Ser Asn Leu 

35 40 45 

Lys Glu Asn Lys Thr Leu Asp Phe His Phe Val Asp Glu Glu Glu Gly 

50 55 60 

Lys Lys Gly Leu Glu Asp Gly Asp Tyr Tyr Met Val Val Thr Leu Pro 
65 70 75 80 

Ser Asp Leu Ser Glu Lys Thr Thr Thr Leu Ser Asn He Gin Ser Thr 

85 90 95 

Ala Ala Tyr Gin Ser Leu Thr Ser Glu Gin Gin Thr Glu He Ser Asp 

100 105" HO 

Ser Val Ser Gin Asn Ser Thr Asp Ser He Gin Ser Ala Gin Ser He 

115 120 125 

Val Ala Leu Val Gin Asp Leu Gin Gly Ser Leu Glu Asn Leu Gin Asn 

130 135 140 

Gin Ser Ser Asn Leu Ser Thr Leu Lys Asn Gin Ser Asn Gin Val Ser 
145 150 155 160 

Pro He Thr Ser Thr Ser Leu He Gly Leu Ser Ser Gly Leu Thr Glu 

165 170 175 

He Gin Gly Asp Val Thr Ser Lys Leu Val Pro Ala Ser Gin Ser He 

180 185 190 

Ala Ser Gly Val Asn Ala Tyr Thr Thr Gly Val Asp Lys Val Ser Gin 



65 



70 



75 



195 



200 



205 
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Gly Ala Ser Gin Leu Ser Glu Lys Asn Ala Thr Leu Thr Gly Ser Leu 

210 2" 220 

Asp Gin Leu val Ser Gly Ser Asn Thr Leu Thr Gin Lys Ser Ser Are 
225 230 235 



Leu Thr Ala Gly Val Gly 
245 



240 



(2) INFORMATION FOR SEQ ID NO: 484: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 84: 

Met Ala Asp Leu Asp His Pro His He Val Arg lie Thr Asp lie Gly 
1 5 10 15 



Glu Glu Asp Gly Gin Gin Tyr Leu Ala Met Glu Tyr Val Ala Gly Leu 

20 25 3 0 

Asp Leu Lys Arg Tyr lie Lys Glu His Tyr Pro Leu Ser Asn Glu Glu 

35 «0 45 

Ala Val Arg He Met Gly Gin lie Leu Leu Ala Met Arg Leu Ala His 

50 55 60 

Thr Arg Gly H e Val His Arg Asp Leu Lys Pro Gin Asn lie Leu Leu 

" 70 75 80 

Thr Pro Asp Gly Thr Ala Lys Val Thr Asp Phe Gly lie Ala Val Ala 

85 90 95 

Phe Ala Glu Thr Ser Leu Thr Gin Thr Asn Ser Asp Val Trp Ala Gin 

100 105 "0 

Phe lie Thr Cys His Gin Ser Arg Arg Val Val Leu Arg Arg Leu Cys 



"5 120 



125 



Arg Val He Ser Met Pro Trp Gly Leu Phe Ser Met Arg Cys 
130 "5 l40 

(2) INFORMATION FOR SEQ ID NO: 485: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4B5: 



Met Leu Ala Thr Ala He Leu Gly Val Val He Glu Phe Leu Ala Tyr 

15 10 is 

Arg Pro Leu Arg His Ser Thr Arg He Ala Val Leu He Thr Ala He 

20 25 30 

Gly Val Ser Phe Leu Leu Glu Tyr Gly Met Val Tyr Leu Val Gly Ala 

35 40 45 

Asn Thr Arg Ala Phe Pro Gin Ala He Gin Thr Val Arg Tyr Asp Leu 

50 55 60 

Gly Pro He Ser Leu Thr Asn Val Gin Leu Met He Leu Ala He Ser 
65 70 75 80 

Leu He Leu Met He Leu Leu Gin Val He Val Gin Lys Thr Lys Met 

85 90 95 

Gly Lys Ala Met Arg Ala Val Ser Val Asp Ser Asp Ala Ala Gin Leu 

100 105 110 

Met Gly He Asn Val Asn Arg Thr He Ser Phe Thr Phe Ala Leu Gly 

115 120 125 

Ser Ala Leu Ala Gly Ala Ala Gly Val Leu He Ala Leu Tyr Tyr Asn 

130 135 140 

Ser Leu Glu Pro Leu Met Gly Val Thr Pro Gly Leu Lys Ser Phe Val 
145 150 155 160 

Ala Ala Val Leu Gly Gly He Gly He He Pro Gly Ala Ala Leu Gly 

165 170 175 

Gly Phe Val He Gly Leu Leu Glu Thr Phe Ala Thr Ala Phe Gly Met 

180 185 190 

Ser Asp Phe Arg Asp Ala lie Val Tyr Gly He Leu Leu Leu He Leu 

195 200 205 

He Val Arg Pro Ala Gly He Leu Gly Lys Asn Val Lys Glu Lys Val 
210 215 220 



{2} INFORMATION FOR SEQ ID NO: 486 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486: 

Met Ser Leu Ala Met Lys Glu Met Gly Gly Asp Val Ser Ala Ala Gin 

5 10 15 

Thr Gly Val He Leu Thr Gly Leu Val Tyr Val Leu Val Ala Thr Ser 

20 

D 30 
He Arg Phe Val Gly Thr Lys Trp n e Asp Lys Leu Leu Pro Pro lie 

35 40 45 

He lie Gly Pro Met He He Val lie Gly Leu Gly Leu Ala Gly Ser 

55 60 
Ala Val Thr Asn Ala Gly Leu Val Ala Asp Gly Asn Trp Lys Asn Ala 

03 7rt 

/o 75 



Leu val Ala Val Val Thr Phe Leu lie Ala Ala Phe tie 



85 



90 



80 

Asn Thr Lys 
95 



Gly Lys Gly P he heu ^ lle n . prQ phe ^ ^ ^ ^ 

100 "5 no 

Gly Tyr Leu Phe Ala Leu Thr Leu Gly Leu Val Asp Phe Thr Pro Val 

115 120 125 

Leu Lys Ala Asn Trp Phe Glu lie Pro Gly Pne ^ Leu prQ phe ^ 



130 



135 



140 



Thr Gly Gly Ala Phe Lys Glu Tyr Asn Leu Tyr Phe Gly Pro Glu Pro 

150 155 



160 



Ser Leu Ser Xaa Gin Ser Xaa Xaa 
165 

(2) INFORMATION FOR SEQ ID NO: 487 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487 : 
Met Val Lys Val Leu Ala Ala Cys Gly Asn Gly Met Gly Ser Ser Met 



10 



15 
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Val lie Lys Met Lys Val Glu Asn Ala Leu Arg Lys Leu Asn Gin Thr 
20 25 30 

Asp Phe Thr Val Asn Ser Trp Pro Val Ser Val Lys Leu Lys Val 



(2) INFORMATION FOR SEQ ID NO: 488: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:486: 

Met Ala Ser Val Gly Glu Ala Lys Gly Leu Ala Val Gly Tyr Asp He 

15 10 15 

Val He Ala Ser Leu His Leu He Gin Glu Leu Glu Gly Arg Thr Asn 

20 25 30 

Gly Lys Leu He Gly Leu Asp Asn Leu Met Asp Asp Lys Glu He Thr 

35 40 45 

Glu Lys Leu Ser Gin Ala Leu Gin 
50 55 

(2) INFORMATION FOR SEQ ID NO: 489: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489: 

Leu He Ser Leu Gly Leu Glu His Ser Leu Pro Leu Ser Ser His Leu 

15 10 15 

Asn He Ser He Gly Pro Leu He Gin Thr Trp Arg lie Gly Phe Ser 



35 



40 



20 



25 



30 
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85 



90 



Asp Ala Lys Val Ala Gin Pro Gin Lys He Glu Ser Val Leu Pro Leu 

40 45 
He Asn Pro His Gly xi. Glu Leu Asp Ser Ser Thr Ser Thr Val Phe 

55 60 
Leu Lys Gin Lys Gly Met Lys lie Asp Leu Gly cys Leu Ala Lys Gly 

75 80 
!lu Gly Val Thr 

Ser Ala Leu lie Asn Leu Gly Gly Asn He Leu Thr He Gly Lys Asn 

ins 

110 

Cln Ala Arg Gly Asp Asn Pro Trp Gin lie Gly Asp Ser Arg Thr Gin 

115 120 125 

Pro He Leu Gly Glu He He 

130 135 

(2) INFORMATION FOR SEQ ID NO: 490: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:490: 

Met lie Asn Lys Tyr Phe Asp Gly Gin He Pro Ala Tyr Val Glu Gly 

5 10 15 

Val Thr Glu Phe Asp His Val Leu Ala Glu Val Ala Glu Lys Ser lie 

20 " 30 

Pro Arg Ala Leu 



25 



Ala Asp Phe His Thr His Met Glu Ala Val Asp Tyr 



35 



40- 



Glu Ala Val Trp Thr Leu He Ser Arg Thr Asn Lys Tyr He Asp Glu 

50 55 60 

Thr Ala Pro Trp Val Leu Asp Lys Asp Glu Ala Leu Arg Asp Gin Leu 

70 7 - 
Ala Ser Val Met Ser His Leu Ala Ala Ser lie ^ Val Val Ala His 



90 



95 



Leu lie Glu Pro Phe Met Met Glu Thr Ser Arg Ala Val Leu Thr Gin 

100 105 110 

Leu Gly Leu Glu Glu Val Ser Ser Leu Glu Asn Leu Lys Phe Gly 



115 



120 



125 
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(2) INFORMATION FOR SEQ ID NO: 491: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 91: 

Met Leu Val Val lie Arg Phe Leu Lys Lys Leu Ser Leu Trp Leu Ser 

15 10 15 

Lys His Arg Lys Pro Val Pro Ser Met He Gin Arg Arg Leu Ser Val 

20 25 30 

Gly Phe Asn Arg Ala Thr Arg Leu Met Glu Glu Leu Glu Met Ala Gly 

35 40 45 

Val He Gly Pro Ala Glu Gly Thr Lys Pro Arg Lys Val Xaa Gin Gin 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 492: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 492: 

Met Val Gly Asn Pro Arg Leu Ala Asp Leu Gly Phe Glu Glu Glu Ala 

1 5 io 15 

Val Gly His His Ala Leu Val Thr Gly Phe Gin Gly Gin Arg Gin Trp 

20 25 30 

Thr Asp His Phe Pro Asn Gly Asp Phe Met Glu Thr Phe Leu Asn Thr 

35 40 45 

Gin Phe Asp Trp Asn Gly He Arg Lys Pro Phe Val Phe Ala Cys He 



50 



55 



60 
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(2) INFORMATION FOR SEQ ID NO: 493 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 114 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 

Met Gly Thr Leu Trp Lys Leu Ser Ser lie Leu Ser Leu Thr Gly Met 

5 10 15 

Val Phe Glu Asn His Leu Tyr Leu Leu Val Phe Ala Thr Glu Asn Asp 
20 5c 

Ser Leu Asn Gly Val Ser Met Leu Phe Asn Tvr t«, t mv 

, e ne Asn L eu Leu Thr Asn Thr 

45 

Pro Glu Ala Val 

Lys Arg Val Thr Gly His Thr Leu Glu Gly Cys Al'a Ala Ala Gly Phe 

70 " 80 

Leu His Leu Ile Asn ser Gly ser ^ Lfiu ^ g ^ 

85 go 

Ala Thr Arg Asn Gly Lys Pro Val Met Lys Pro Phe Trp Glu Leu Glu 



30 

n Gly Val Ser Met Leu Phe Asn Tyr Leu Leu 

35 

Pro Gin lie Phe Ala Asp Val Arg Thr Tyr Trp Ser 



50 



55 



60 



100 



Glu Ser 



105 



110 



(2) INFORMATION FOR SEQ ID NO: 494 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH :. 146 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
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Met Lys Leu His Glu Leu Lys Pro Ala Glu Gly Ser Arg Lys Val Arg 

15 10 15 

Asn Arg Val Gly Arg Gly Thr Ser Ser Gly Asn Gly Lys Thr Ser Gly 

20 25 30 

Arg Gly Gin Lys Gly Gin Lys Ala Arg Ser Gly Gly Gly Val Arg Leu 

35 40 45 

Gly Phe Glu Gly Gly Gin Thr Pro Leu Phe Arg Arg Leu Pro Lys Arg 

50 55 60 

Gly Phe Thr Asn He Asn Ala Lys Glu Tyr Ala He Val Asn Leu Asp 
65 70 75 80 

Gin Leu Asn Val Phe Glu Asp Gly Ala Glu Val Thr Pro Val Val Leu 

85 90 95 

He Glu Ala Gly He Val Lys Ala Glu Lys Ser Gly He Lys He Leu 

100 105 HO 

Gly Asn Gly Glu Leu Thr Lys Lys Leu Thr Val Lys Ala Ala Lys Phe 

115 120 125 

Ser Lys Ser Ala Glu Glu Ala He Thr Ala Lys Gly Gly Ser Val Glu 

130 135 140 

Val He 



(2) INFORMATION FOR SEQ ID NO: 4 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 95: 

Met Gly Ala Ala Lys Leu Val Val Phe Ala Asn Ala Val Glu Asp Asn 

15 in is 

Pro Phe Met Ala Gly Ala Phe His Gly Val Gly Glu Ala Asp Val He 

20 25 30 

He Asn Val Gly Val Ser Gly Pro Gly Val Val Lys Arg Ala Leu Glu 

35 40 45 

Lys Val Arg Gly Gin Ser Phe Asp Val Ser Lys Pro Lys Thr Ser 
50 55 60 



145 
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(2) INFORMATION FOR SEQ ID NO: 496 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15S amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496: 

Met Thr Glu Lys Arg Leu Ala Trp Asp Glu Tyr Phe Ala Ala Gin Ala 

Leu Leu lie Ala Asn Arg Ser Thr Cys Lys Arg Ala Lys Val Gly Ala 

lie Leu val Lys Asp Asn Lys Val lie Ser Thr Gly Tyr Asn Gly Ser 

35 « 45 

Val Ser Gly Thr Glu His Cys lie Asp His Glu Cys Leu Val lie Glu 

50 55 60 

Gly His Cys Val Arg Thr Leu His Ala Glu Val Asn Ala lie Leu Gin 
" 70 75 80 

Gly Ala Glu Arg Gly Val Pro Lys Gly Phe Thr Ala Tyr Val Thr His 

85 9 0 95 

Phe Pro cys Leu Asn Cys Thr Lys Gin Leu Leu Gin Val Gly Cys Lys 

100 105 110 

Arg Val Val Tyr lie Asn Gin Tyr Arg Met Asp Asp Tyr Ala Gin Tyr 

115 120 125 

Leu Tyr Gin Glu Lys Gly Thr Glu Leu Thr His Leu Pro Leu Glu Thr 

130 "5 140 

Val Gin Thr Ala Leu Lys Glu Ala Asp Leu Met 
"5 150 155 

(2) INFORMATION FOR SEQ ID NO: 4 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 188 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497: 



Leu Tip Asp Gin Xaa Gin Leu Asp lie Ser Lys Thr Asn Tyr Leu Arg 

15 10 15 

Val lie Asn Ala Phe Thr Gin lie Glu Ala Ala Lys Ala Tyr Leu Phe 

20 25 30 

Ala Asn Ser Glu Phe Ser Gly Ala Asp Trp Asp Thr Lys lie Ser Arg 

35 40 45 

Asp lie Phe Trp Glu Glu Ser Met His Gly He Tyr Pro Glu Asn Val 

50 55 60 

Gly Val Asn Ala Arg Leu Leu Asn Asp Glu Ala Asp Phe Phe Asp Tyr 
65 70 75 80 

Leu Asn His Ser Ala He Phe Thr Ala Glu Arg Asp Gly Gin Thr Tyr 

85 90 95 

Tyr Phe Tyr Pro He Gin Ala Gly Asp Tyr Leu Ala Thr Pro Glu He 

100 105 no 

Gin Ala Phe Ala Leu Asn Gly Asp Glu Val He He Tyr Pro Gin Glu 

115 120 125 

Lys Asp Phe Glu Thr His Arg Ser Tyr Gin Tyr Gin Asp Leu Thr Thr 

130 135 140 

Arg Gly Thr Val Glu Phe Arg Ser Val Cys Thr Gin Pro Leu Asp Arg 
145 150 155 160 

Thr Phe Ala Ser Ala Ala Phe His Leu Gly Leu Leu Val Asn Leu Asp 

165 170 175 

Lys Leu Glu Ala Tyr Leu Glu Thr Ala Pro Phe Leu 
180 185 



(2) INFORMATION FOR SEQ ID NO: 498 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498: 



Met Asp Ser Lys Ala Lys Lys Asp Phe Glu Asn Met Val Ala He Val 

1 5 10 15 

Ala Gin Lys Ala Arg Ser Leu Ser Met Gly Val Cys He Val Met Gin 
20 25 30 
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Gin Pro Arg Ser Asp Ser Leu Ser Thr Asn lie Arg Glu Gin Leu Val 

35 40 45 

Asn Ala lie Phe Leu Gly Ala Pro Thr Arg Glu Ser Ser His Met Met 



55 



60 

Phe Gly Thr Thr Asp Val Pro Gin Val Lys Lys Asp Lys Gly Val Gly 
65 70 



75 80 



Leu Tyr Ser Thr Asp Arg Glu Pro Pro Lys Glu Phe His Ser Pro Met 
85 

Phe Asp Arg Asp Val Phe Glu Val He Leu Pro Val Trp Glu Trp 

n r . 105 "0 

Ala Lys Asp Tyr Met Lys Asp Glu Asp Glu Asp Val 

"5 120 
(2) INFORMATION FOR SEQ ID NO: 499: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499 : 

Met Arg His Ser lie Tyr Asp Gly L ys Leu Gly Phe Asp Leu Lys Glu 

5 10 15 

Glu Gly Lys Gin Val Asp Leu Thr Leu Trp Ser Pro Ser Ala Asp Lys 

20 25 30 

Val Ser Val Val Val Tyr Asp Lys Asn Asp Pro Asp Lys Val Val Gly 



40 



Thr Val Ala Leu Glu Lys Gly Glu Arg Gly Thr 



50 



55 



45 

Trp Lys Gin Thr Leu 
60 



Asp Ser Thr Asn Lys Leu Gly He Thr Asp Phe Thr Gly Tyr Tyr Tyr 

70 ? 5 80 

Gin Tyr Gin lie Glu Arg Gin Gly Lys Thr Val Leu Ala Leu Asp Pro 



90 



Tyr Ala Lys Ser Leu Ala Ala Trp Asn Ser Asp Asp Ala 



100 



105 



95 

Lys He Asp 
110 



Asp Ala His Lys Val Ala Lys Ala Ala Phe Val Asp Pro Ala Gin Leu 

120 125 
Gly Pro Gin Asp Pro Phe Leu Arg Gly Leu Met Gly Gin lie Arg Phe 



135 



140 
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Thr lie Ser Arg Leu Val Lys Thr Pro Leu Ser Thr Lys Leu Met Cys 
145 150 155 160 

Val Thr Ser Leu Gin lie Leu Pro Leu Gin Lys Thr 



(2) INFORMATION FOR SEQ ID NO: 500: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500: 

Met Cys Met Phe Glu Val Glu Glu Trp Leu His Ser Arg He Gly Leu 

15 10 15 

Asn Phe Arg Ser Gly Leu Ala Arg He Gin Gin Ala Val Asp Leu Leu 

20 25 30 

Gly Asn Pro Glu Gin Ser Tyr Pro He He His Val Thr Gly Thr Asn 

35 40 45 

Gly Lys Gly Ser Thr He Ala Phe Met Arg Glu Leu Phe Met Gly His 

50 55 60 

Gly Lys Lys Val Ala Thr Phe Thr Ser Pro His He Val Ser He Asn 
65 70 75 80 

Asp Arg He Cys He Asn Gly Gin Pro He Ala Asp Ala Asp Phe He 

85 90 95 

Arg Leu Ala Asp Gin Val Lys Glu Met Glu Lys Thr Leu Leu Gin Thr 

100 105 HO 

Pro Asp Gin Leu Ser Phe Phe Glu Leu Leu Thr Leu Val Ala Phe Leu 

115 120 125 

Tyr Phe Arg Glu Gin Glu Val Asp Leu Val Leu Leu Glu Val Gly He 

130 135 140 

Gly Gly Leu Leu Asp Thr Thr Asn Val Val Thr Gly Glu Leu Ala Val 
145 150 155 160 

He Thr Ser He Gly Leu Asp His Gin Lys Thr Leu Gly Asp Ser Ser 



165 



170 



165 



170 



175 



(2) 



INFORMATION FOR SEQ ID 



NO:501: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:501: 

Met lie Ser Tyr Asp Thr Arg Met Leu Leu Leu lie Ala lie Phe Ser 

15 10 15 

Val Phe Leu Leu Tyr Leu Ser Glu lie Arg Phe Lys Asp Val Ser Phe 



20 



25 



30 



Val Ala Val Phe Ala Thr Val Phe Ala Val Leu Asn Val Leu Met Val 

35 « 45 

Tyr Leu Phe Ser Pro Glu Tyr Gly Val Gly Leu Tyr Gly Glu Arg Ser 



50 



55 



60 



Val lie Trp Gin Gly He Gly Val Tyr Thr Leu Thr Ser Gin Glu Leu 



65 



70 



7 5 80 



Phe Tyr Leu Leu Asn Leu Val lie Lys Tyr Leu Cys Thr lie Pro Leu 

85 90 95 

Ala He lie Phe Leu Met Thr Thr His Pro Ser Gin Phe Ala Ser Ser 

110 

Val Ser Leu 



100 



105 



Leu Asn Gin He Gly Val Pro Tyr Lys lie Ala Tyr Ser 



115 



120 



125 



Thr Leu Arg Tyr He Pro Asp Leu Gin Glu Glu Phe Phe Thr He Lys 



135 



140 



Met ser Gin Glu Ala Arg Gly Met Glu Leu Ser Lys Lys Ala Ser Leu 

5 150 155 leo 

Met Gin Arg He Lys Gly Asn Leu Leu He He Thr Pro Leu He Phe 



"5 170 
Ser Ser Leu Glu Thr His 

180 

(2) INFORMATION FOR SEQ ID NO: 502; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



175 



(ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502: 

Met Pro He Ser Glu Arg Thr Val Tyr Gin He Gin Ser Asp Gin Glu 

15 10 15 

Ser Leu Tyr Ala Lys Phe Asn Pro Ala Leu Thr Phe Val Pro Val Asp 

20 25 30 

Phe Glu Asp Leu Ser Ser Gin Met Thr Tyr Asn Lys Cys Val Thr Ala 

35 40 45 

Phe Ala Gin Glu Pro Leu Asp Ala Ala He Gin Lys He Ser Pro Glu 

50 55 60 

Leu Phe Asp Gin Tyr Glu lie Phe Lys Ser Arg Glu Met Leu Leu Glu 
65 70 75 80 

Trp Ser Pro Lys Asn Val His Lys Ala Thr Gly Leu Ala Lys Leu He 

85 90 95 

Ser His Leu Gly He Asn Gin Ser Gin Val Met Ala Cys Gly Asp Glu 

100 105 no 

Ala Asn Asp Leu Ser Met lie Glu Trp Ala Gly Leu Gly Val Ala Met 

115 120 125 

Gin Asn Ala Val Pro Glu Val Lys Ala Ala Ala Asn Val Val Thr Pro 

130 135 140 

Met Thr Asn Asp Glu Glu Ala Val Ala Arg Ala He Glu Gin Tyr Val 
145 150 155 160 

Leu Lys Glu Asn 



(2) INFORMATION FOR SEQ ID NO: 503; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503: 
Met Lys He Lys Val Val Thr Val Gly Lys Leu Lys Glu Lys Tyr Leu 



Lys Asp Gly He Ala Glu Tyr Ser Lys Arg He Ser Arg Phe Ala Lys 



1 



5 



10 



15 



20 



25 



30 
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«. <n. n. 01u u. ser *p 01u L „ nr Pro Asp Lys Ala ser 

40 

Glu Ser Glu Asn Gin Lys „. Leu oi« lie 0 1„ Gly «» Ar g X1 . Leu 



50 55 

55 60 



Ser Lys He Ma Asp Ar g Asp Phe Val Jl# ^ ^ ^ ^ ^ ^ 
Lys Thr Phe P he Ser 11 Glu Phe Ser Lys ^ ^ ^ ^ ^ ^ 



85 90 



95 



He Ar g Arg Asp Phe ^ ser ^ phe ^ ^ ^ ^ s ^ 

105 110 

He Val He He Cys Lys Lys 
115 



(2) INFORMATION FOR SEQ ID NO:504: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:504: 

Leu Glu Gin Ala Pro Gly Leu Ser Leu Ala Asp Thr Leu Ser Leu His 

5 10 15 

TVT Ala Met Ar g Asn Glu Leu Thr Leu Ser Pro Val Asp Phe Leu Leu 

20 5c 

30 

Ar g Ara Thr Asn His Met Leu Phe Met Ar g Asp Ser Leu Asp Ser lie 

40 45 
Val Glu Pro Val Leu Asp Glu Met Gly Ar g Phe Tyr Asp Trp Thr Glu 

55 60 
O u Glu Lys Ala Thr Tyr Ara Ala Asp Val Lys Ala Ala Leu Ala Gin 



60 

u 6 iU Lys Ala Thr Tyr Ar g Ala Asd Val t.„ 

65 

Asn Asp Leu Ala Glu Leu Lys Asn 
85 



75 80 



(2) INFORMATION FOR SEQ ID NO: 505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505: 

Met Glu Asp lie Leu Tyr Ala Pro Ala Glu Gin Asn Asn Gly Ser Ala 

15 10 15 

He Glu Lys Ala Phe Gly Lys Asn Gly Lys He Ala Phe Gin Lys Ser 

20 25 30 

Val Asp Lys Tyr Trp Lys Asn Leu He Phe Met Phe Lys Asn Thr Pro 

35 40 45 

Ala Ala Glu Gly Asn Asp Ser Thr Thr Glu Ser Tyr Met Lys Gly Leu 

50 55 60 

Trp Leu Ser Asn His Thr Tyr Gin Trp Gly Gly Leu Met Asp Thr Trp 
65 70 75 80 

Lys Trp Tyr Glu Thr Gly Lys Trp Lys Leu Phe Ala Ser Gly Asn He 

85 90 95 

Gly Lys Ser Gin Gly Asp Arg Gin Trp Leu Thr Glu Pro Glu Ser Met 

100 105 110 

Leu Gly Glu Glu Ala Leu Gly Val Tyr Leu Asn Gly Gly Val Val Tyr 

115 120 125 

Asn Phe Glu His Pro Ala Tyr Thr Tyr Gly Val Asn Asn Lys Glu Ser 

130 135 140 

Leu Leu Phe Ser Glu Val He Lys Glu Phe Phe Arg Tyr Val He Ala 
145 150 155 160 

His Pro Ala Pro Ser Lys Glu Lys Tyr Leu Arg He Gin Lys Tyr Ser 

165 170 175 

Tyr Met Glu Thr He Gin He Lys Glu Met Val Ser Ser Leu 



(2) INFORMATION FOR SEQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



180 



185 



190 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: 
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? Ttr ne ° iu ■? rar p ~ - - « «. ^ oau Pto 

10 

oiu a. ^ tys ^ Lys ^ val My ue ^ £ ^ 

- - £o va! «. u. „ is " u IU V4l Wa Asp ~ 

40 „ 



«. «. X«U 01y ^ „ ». yal ^ ^ ^ Mu ^ ^ ^ 

60 

Pro H ls val Asp Lys L ys Glu Thr lie Pro Glu His His a , 
65 7n Hls His ^9 I-eu Lys 



Met Leu Glu Phe Gly Asn 
85 



70 ?5 



80 



(2) INFORMATION FOR SEQ ID NO: 507 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:507: 

Leu Leu Xaa Gl y S er Leu Thr Xaa Thr His Val Ala Met Thr Trp Gin 

Oln Leu Val Val Thr Lys Ser He Glu Lys Leu Asn Val Lys Thr Ar g 

^ Val Gin Pro Asn His Trp Gin H Pro Leu Val Asp H G l u T hr 

40 

Gly Glu He Leu Val Glu Ala Gly Thr lie Met Thr Arg 



50 55 

" 60 

(2) INFORMATION FOR SEQ ID NO: 508: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: 

Met Lys Lys Lys Leu Leu Ala Gly Ala lie Thr Leu Leu Ser Val Ala 

15 10 15 

Thr Leu Ala Ala Cys Ser Lys Gly Ser Glu Gly Ala Asp Leu He Ser 

20 25 30 

Met Lys Gly Asp Val He Thr Glu His Gin Phe Tyr Glu Gin Val Lys 

35 40 45 

Asn Asn Pro Ser Ala Gin Xaa Val Leu Leu Asn Met Thr He Gin Lys 

50 55 60 

Val Phe Glu Lys Thr He Trp Leu Arg Ala 
65 70 

(2) INFORMATION FOR SEQ ID NO: 5 09: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509: 

Met Thr Gly Glu He Arg Asn Leu Gin Ala Gin Asp Tyr Asp Ala Ser 

15 10 15 

Gin He Gin Val Leu Glu Gly Leu Glu Ala Xaa Xaa Met Arg Ser Gly 

20 25 30 

Xaa Tyr He Gly Ser Thr Ser Lys Glu Gly Leu His His Leu Val Trp 

35 40- 45 

Glu He Val Asp Asn Ser He Asp Glu Ala Leu Ala Gly Phe Ala Ser 

50 55 60 

His He Gin Val Phe He Glu Pro Asp Asp Ser He Thr Val Val Asp 
65 70 75 80 

Asp Gly Arg Gly He Pro Val Asp He Gin Glu Phe Thr Gly Arg Pro 

85 90 95 

Ala Val Glu Thr Val Phe Thr Val Leu His Ala Gly Gly Lys Phe Gly 

100 105 110 

Gly Gly Gly Tyr Lys Val Ser Gly Gly Leu His Gly Val Gly Ser Ser 



115 



120 



125 
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Val Val Asn Ala Leu Ser Thr Gin Leu Asp Val His ^ His ^ ^ 
Oly Lys lie His Tyr Gln ^ ^ ^ Qly ^ ^ ^ ^ 



145 150 

155 



t2) INFORMATION FOR SEQ ID NO: 510: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 165 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510: 
Met Val Leu Ser Lys Tyr Tyr Gly Val Ala Asp Gl y Met Asn Val Glu 

«Y Ar 9 Giy ser Ala Asn Phe lie Lys Asp Asn Val Leu lie Thr Ser 

25 

Gly Leu Thr Thr Asn Tyr Arg His Asp Tyr Gly Lys Glu Ala Asp Asp 

40 45 
Xle Tyr Val Leu Pro Ala Val Ser Pro Ser Gin Glu Leu Phe Gly Lys 

55 60 
lie Lys Val Lys Glu Val Cys Tyr Leu Lys Glu Phe Ar g Asn Leu Asn 

Ser Lys Asp Ala Arg Glu Tyr Asp Leu Ala Leu Leu lie Leu Glu Glu 

^ 90 
Pro lie Gly Ala Lys Leu Gly Thr Leu Gly Leu Pro Thr Ser Gin Lys 

100 "5 110 

Asn Leu Thr Gl y lie Thr Val Thr lie Thr Gly Tyr Pro Ser Tyr Asn 

120 125 
Phe Lys lie His Gin Met Tyr Thr Asp Lys Lys Gin Val Leu Ser Asp 

ASP Gly „et Phe Leu Asp £ Gin Vai Asp Thr Leu' Glu Gly Ser Ser 

150 155 
Gly Tyr Tyr Ser Leu 160 

165 

(2) INFORMATION FOR SEQ ID NO: 511 : 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511: 

Met Val Glu Asn Pro lie Leu Gly Thr Ala Glu Glu Gly Ala Leu Phe 

15 10 is 

Ser Leu Thr Ala Glu Gly Lys He Val Val Asn Asn Pro His Lys Ala 

20 25 30 

Asp He Glu Pro Ser Leu Leu Asp His Phe Phe Val Asp Thr His Gin 

35 40 45 

Asp Pro Ser Tyr Gin Ala Cys 
50 55 

(2) INFORMATION FOR SEQ ID NO: 512: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 

Leu Lys Lys He Ser Iiys Gin Glu Leu Val Asn Thr Arg Phe Ser Arg 

1 5 10 15 

Leu Phe Ala His Phe Gly Gin Glu Lys Asp Gly Ser Phe Leu Ala Gin 

20 25 30 

Arg Tyr Gin Phe Tyr Leu Ala Gin Gin Gly Gin Thr Leu Ser Gly Ala 

35 40 45 

His Asp Leu Leu Asp Ser Leu He Glu Arg Asp Tyr Asn Leu Tyr Ala 

50 55 60 

Ala Thr Asn Gly He Thr Ala He Gin Thr Gly Arg Leu Ala Gin Ser 
65 70 75 80 

Gly Leu Ala Pro Tyr Phe Asn Gin Val Phe He Ser Glu Gin Leu Gin 



85 



90 



95 
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Thr Gin Lys Pro Asp Ala Leu Phe Tyr oi„ Lys He Gly cm Gin He 

105 

Ala Gly Phe Ser Lys Glu Lys Thr Leu Met He Gly Asp Ser Leu Thr 

120 125 
Ala Asp lie Gin Gly Gly Asn Asn Ala Gly He Asp Thr n e Trp ^ 



135 



140 



Asn Pro His His Leu Glu Asn His Thr Gin Ala Gin Pro Thr Tyr Glu 
143 150 



155 



Val Tyr ser Tyr Cln Asp Leu Leu Asp Cys Leu Asp Lys Asn I le Leu 



165 170 
Glu Lys Ile Thr Phe 

180 

(2) INFORMATION FOR SEQ ID NO: 513 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



160 

175 



(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:513 : 

Met Lys Tyr Asp Asp Ala Met Ala Leu Tyr Gly Ser Asp Lys Pro Asp 

5 10 15 

Thr Arg P he Asp Met Leu Leu Gin Asp Leu Thr Glu Val Val Lys Gly 

20 25 
Val Asp Phe Lys Val Phe Ser Glu Ala Leu Ala Val Lys n e Val 

40 45 

Val Asn Gly Leu Glu Pro Leu Thr Ala a™ ie « «. ~ 

50 Ala tep Asn Ser Arg Lys Asp 

55 60 

lie Asp Lys „ et Thr Glu Val Ala Lys Gin Tyr Gly Ala Lys Gly Leu 



75 



80 



Ala Trp Val Lys Val Val Asp Gly Glu Leu Asn Gly Pro Val Ala Lys 
85 



90 95 



Phe Leu Thr Gly He Gin Glu Glu Leu Thr Thr Ala Leu Ala Leu Glu 

100 105 110 

Asp Lys Asp Leu Val Leu Phe Val Ala Asp Thr Leu Glu Val Ala Asn 

120 125 
Ala Thr Leu Gly Ala Leu Ar g Gly Ar g rie Ala Lys Glu Leu Gly Leu 



135 



140 
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lie Asp Asn Asp Lys Phe Asn Phe Leu Trp Val Val Asp Trp Pro Met 
145 150 155 160 

Phe 



(2) INFORMATION FOR SEQ ID NO: 514: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 

Met Val Lys Tyr Gly Val Val Gly Ala Gly Tyr Phe Gly Ala Glu Leu 

15 10 15 

Ala Arg Tyr Met Gin Lys Asn Asp Gly Ala Glu lie Thr Leu Leu Tyr 

20 25 30 

Asp Pro Asp Asn Ala Glu Ala lie Ala Glu Glu Leu Gly Ala Lys Val 

35 40 45 

Ala Ser Ser Leu Asp Glu Leu Val Ser Ser Asp Glu Val Asp Cys Val 

50 55 60 

lie Val Ala Thr Pro Lys 
65 70 

(2) INFORMATION FOR SEQ ID NO: 515: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515: 

Met Lys He Asp Lys Tyr Ser Ala He Leu Gly Asn Thr Val Gly Phe 
15 10 15 
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His Asn Met Ser Thr Leu Thr Asp His Arg Pro Val Ala Ser Leu Pro 
20 25 
Phe oiy Gly Lys Tyr Arg Leu lie Asp Phe Pro Leu Ser sir Leu Ala 



35 40 



Asn Ala Gly Val Arg Ser Val Phe Gly lie Phe Gin gL 



50 



55 



Asp Asn lie 



Ser ser Val Phe Asp His lie Arg Ser Gly Arg Glu Trp Gly Leu Ser 
Thr Leu Leu Ser His Tyr Tyr L*u Gly He £r Asn Thr Arg Val Glu 



Ser 



85 90 



95 



(2) INFORMATION FOR SEQ ID NO: 516: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:516: 

5 10 15 

L ys Glu n e Leu Glu ^ val Qlu Lys Asp ^ ^ ^ ^ ne ^ 

20 oc 

30 

Asn Leu lie Gly Leu Leu Ala ^ ^ ^ ^ ^ ^ ^ ^ 

40 45 
Gin lie Val Ser Ser He His Arg Arg Asn Ser Leu Thr Pro Val Glu 

Arg Glu Val Val Cln Xle Thr Ala Ala Val Thr Asn Gly Cys Ala Phe 

«-u Cys Arg Arg Ser Gin His p ro Phe Pro sL Asn Lys Ser Arg 
85 



80 



(2) INFORMATION FOR SEQ ID NO: 517: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 

Met Lys He Gly He He Ala Ala Met Pro Glu Glu Leu Ala Tyr Leu 

15 10 15 

Val Gin His Leu Asp Asn Ala Gin Glu Gin Val Val Leu Gly Asn Thr 

20 25 30 

Tyr His Thr Gly Thr He Ala Ser His Glu Val Val Leu Val Glu Ser 

35 40 45 

Gly He Gly Lys Val Met Ser Ala Met Asn Cys Val Ala Asn Phe Gly 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 518: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 

Met Leu His Asn Ala Phe Ala Tyr Val Thr Arg Lys Phe Phe Lys Ser 

15 10 15 

He Val He Phe Leu He He Leu Leu Met Ala Ser Leu Ser Leu Val 

20 25 30 

Gly Leu Ser He. Lys Gly Ala Thr Ala Lys Ala Ser Gin Glu Thr Phe 

35 40 45 

Lys Asn He Thr Asn Ser Phe Ser Met Gin He Asn Arg Arg Val Asn 

50 55 60 

Gin Gly Thr Pro Arg Gly Ala Gly Asn He Lys Gly Glu Asp He Lys 
65 70 75 80 

Lys He Thr Glu Asn Lys Ala He Glu Ser Tyr Val Lys Arg He Asn 

85 90 95 

Ala He Gly Asp Leu Thr Gly Tyr Asp Leu He Glu Thr Pro Glu Thr 



100 



105 



110 
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Lys Lys Asn Leu Thr Ala Asp Arg Ala Lys Arg Phe Gly Ser Ser 

115 120 125 

Thr lie Thr Gly Val Asn Asp Ser Ser Lys Gly Asp ^ ^ ^ ^ 



130 
Gly Pro He 
145 



135 



140 



(2) INFORMATION FOR SEQ ID NO: 519: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:519 : 
Leu Arg Lys Asn lie Gly Leu Val Leu Gin Asp Leu Ser lie Arg Glu 

Leu Leu ser Pro He Ser Pro Cys Thr Lys Lys Ser Val Met si Arg 

20 25 
Phe Lys Ala Ala Ala Ala Phe Val Asp Ala Asp ser Phe Ue Gin Glu 

35 40 45 

Leu Pro Gin Arg Tyr Asp Ser Pro Val Ser Glu Arg Gly Ser Ser Phe 

50 55 60 

Ser Thr Gly cm Arg Gin Leu Leu Ala Phe Ala Arg Thr Val Ala Ser 

70 75 80 

Gin Pro Lys il e L eu lie Leu Asp Glu Ala Thr Ala Asn He Asp Ser 

85 gQ 
Glu Thr Glu Ser Leu Val Gin Ala Ser Leu Ala Lys Met Arg 11 Gly 

1Q °- "5 no 

Arg Thr Thr Xle Ala lie Ala His Arg Leu Ser Thr He Gin Asp Ala 

Asn Cys lie Tyr Val Leu Asp Lys Gly Arg lie lie Glu Ser Gly Thr 

. " 0 135 140 

His Glu Glu Leu Leu Ala Leu Gly Gly Thr Tyr His Lys Met Tyr Ser 



150 



Leu Gin Ala Gly Ala Met Ala Asp Thr Leu 
"5 170 



155 



160 



(2) INFORMATION FOR SEQ ID NO: 520: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520: 

Met Arg Asn Met Lys Ala Lys Tyr Ala Val Trp Val Ala Phe Phe Leu 

15 10 15 

Asn Leu Thr Tyr Ala He Val Glu Phe He Ala Gly Gly Val Phe Gly 

20 25 30 

Ser Ser Ala Val Leu Ala Asp Ser Val His Asp Leu Gly Asp Ala He 

35 40 45 

Ala He Gly He Ser Ala Phe Leu Glu Thr He Ser Asn Arg Glu Glu 

50 55 60 

Asp Asn Gin Tyr Thr Leu Gly Tyr Lys Arg Phe Ser Leu Leu Gly Ala 
65 70 75 80 

Leu Val Thr Ala Val He Leu Val Thr Gly Ser Val Leu Val He Leu 

85 90 95 

Glu Asn Val Thr Lys He Leu His Pro Gin Pro Val Asn Asp Glu Gly 

100 105 110 

He Leu Trp Leu Gly He He Ala He Thr He Asn Leu Leu Ala Ser 

115 120 125 

Leu Val Val Gly Lys Gly Lys Thr Lys Asn Glu Ser He Leu Ser Leu 

130 135 140 

His Phe Leu Glu Asp Thr Leu Gly Trp Val Ala Val He Leu Met Ala 
145 150 155 160 

He Val Pro Ser He 
165 

(2) INFORMATION FOR SEQ ID NO: 521: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Protein 
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fxi) SEQUENCE DESCRIPTION: SEQ ID NO:521: 
Met cm Thr Ma ^ ne ser yal ne ^ v ^ ^ ^ v ^ 

Gin Tyr Leu Glu Lys Ser He Ala Ser Gln Lys Gln Thr ^ ^ 

20 jc 

" 30 
Asn Leu Glu He lie Leu Val Asp Asp G l y Ala Thr Asp Glu ser Gly 

Arg Leu Cys Asp Ser lie Ala Glu Gln Asp Asp Arg Val Ser Val Leu 

His Lys Lys Asn Glu Gly Leu Ser Gln Ala Arg Asn Asp Gly Met Lys 



75 



Gln Ala His Gly Asp iy r Leu lie Phe He Asp Ser Lys 



80 



85 90 



(2) INFORMATION FOR SEQ ID NO: 522 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 

Met lie Lys Asp lie Gln Gly Asp Lys Met Asn Asn Asn Leu Leu Val 
5 10 15 

^0 25 
Gly Val Ala. Leu Glu «» ser Pro Thr ^ Lys ^ ^ ^ ^ ^ 

40 45 
Hx. Asp He Thr Pro Tyr Asn lie Phe Glu Gly Ser Tyr Arg Leu Phe 

55 60 
Gln Thr Val Asp Tyr Trp Pro Glu Gly Thr Thr Phe Val Ser Val Val 

70 75 
Asp Pro Gly Val Gly Ser Lys Arg Lys Ser Val Val Ala Lys Thr IL 



85 



Gln Lys Ser He His Cys His Ala Arg 
100 105 



90 



95 
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(2) INFORMATION FOR SEQ ID NO: 523: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523: 



Leu Val Glu Val Gin Lys lie Lys Val lie Arg Thr Lys Lys Gly Glu 

15 10 15 

Asn Met Ala Phe Leu Gin Ala Asp Asp Ser Lys Lys Lys Leu Asp Val 

20 25 30 

Thr Leu Phe Ser Asp Leu Tyr Arg Gin Val Gly Gin Glu lie Lys Glu 

35 40 45 

Gly Ala Phe Tyr Tyr Val Lys Gly Lys lie Gin Ser Arg Asp Gly Arg 

50 55 60 

Leu Gin Met He Ala Gin Glu He Arg Glu Ala Val Ala Glu Arg Phe 
65 70 75 80 

Trp He Gin Val Lys Asn His Glu Ser Asp Gin Glu He Ser Arg He 

85 90 95 

Leu Glu Gin Phe Lys Gly Pro He Pro Val He He Arg Tyr Glu Glu 

100 105 110 

Glu Gin Lys Thr He Val Ser Pro His His Phe Val Ala Lys Ser Asn 

115 120 125 

Glu Leu Glu Glu Lys Leu Asn Glu He Val Met Lys Thr He Tyr Arg 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 524 : 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524: 
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Met Asn Ser Phe Lys Asn Phe Lys Glu Trp Gly Leu Phe Leu Leu 

He Leu Ser Leu Leu Ma Leu Ser Arg He Phe Phe Trp Ser ^ ^ 

20 25 
Arg Val Glu Gly His Ser Met Asp Pro Thr Leu Ala Asp Gly Glu He 

40 

Le« Phe Val Val Lys His Leu Pro lie Asp Arg Phe As'p He Val Val 
Ala Hxs Glu Glu Asp Gly Asn Lys Asp He Val Lys Arg Val He Gly 



Met Pro Gly Asp Thr He Arg Tyr Glu Lys 



75 



80 



85 90 



(2) INFORMATION FOR SEQ ID NO: 525: 

(if SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525 : 

Met Val Leu Gin Arg Asn Glu He Asn Glu Lys Asp Thr Trp Asp Leu 

5 10 15 

Ser Thr He Tyr Pro Thr Asp Gin Ala Trp Glu Glu Ala Leu Lys Asp 

20 5c 

30 

Leu Thr Glu Gin Leu Glu Thr Val Ala Gin Tyr Glu Gly His Leu Leu 



35 



Asp Ser Ala Asp Asn Leu Leu Glu He Thr Glu Phe 



50 



55 



Ser Leu Glu Met 



60 



Glu Arg Gin Met Glu Lys Leu Tyr Val Tvr Ala His Met Lys Asn Asp 

Gin Asp Thr Arg Glu Ala Lys Tyr Gin Glu Zr Tyr Ala Lys Ala Zt 

85 90 
Thr Leu Leu Gin Pro Val Arg Pro Ser Leu Phe n- Leu " 
10 ° 105 

(2) INFORMATION FOR SEQ ID NO: 526: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526: 

Met Met He Glu Lys Glu Asp Leu Thr Pro Phe He His Leu Gin His 

15 10 15 

His Gin Gin Glu Leu Thr Leu Lys Asp Gly He Val He Thr Ala Lys 

20 25 30 

Leu Ala Gin Leu Ala Gly Val Lys Val Gly Gin Thr Leu 



(2) INFORMATION FOR SEQ ID NO: 527: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:527: 

Leu Ala Leu Phe Asp Glu Lys Asp Gin Phe val Gin Thr Val Thr He 

1 5 10 15 

Ala Ser His Arg Lys Gin Lys Asn Phe Asp He He Lys Phe Lys Asp 

20 25 30 

Met Tyr His„Ile.*-Asn-Thr He Glu Lys Tyr Lys Gly Tyr Ser Leu Lys 

35 40 45 

Val Ala Glu Glu Asp Leu Asn Asp Leu Asp Asp Gly Glu Phe Tyr Tyr 

50 55 60 

His Glu He He Gly Leu Glu Val Tyr Glu Gly Asp Ser Leu Val Gly 
65 70 75 80 

Thr He Lys Glu Asn Pro Ala Thr Arg Cys 



35 



40 



45 



85 



90 



(2) INFORMATION FOR SEQ ID 



NO:528: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:528: 

Met Pro Val Ser Arg Met Trp Cys Ser Thr Ser lie Leu Arg n e Leu 

10 

«. « »P Ser Thr S er M. t „. Ar, Lys Thr Tyr Trp £ a„ 

Leu ne Oln ser Phe Thr Gly Ser 11 Gly phe ^ £ m 

" 40 
He Leu Met Met Leu oiy Ser Leu Ala Leu Val Gly Za Lys Val Thr 

Ser Pro Asn net cl„ Thr Thr Ala Asn Ala Tyr l^eu Thr Thr Al, Cl„ 

* Leu Asp i« Ala « „e C Ser Asn ,yr cly ^ Asp CX» Al. Tyr 

90 

Oln Glu Glu Leu Lys Gin Thr Glu Gly Ala Glu Val Glu Phe Gl'y Tyr 

110 

Leu Thr Asp Val Thr Met Asp Asn Gly Gin Asp Ala Xle Arg Leu Tyr 



120 



125 



Ser Lys Pro Glu Arg He Ser Thr Phe Gin Leu Arg Lys Gly 



135 



140 



(2) INFORMATION FOR SEQ ID NO: 529: 

(i) SEQUENCE CHARACTERISTICS: 

(A) ^LgjcePHs .37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:529: 

Met Tyr val Ala Thr Glu Arg Glu He Ala Lys His Ile Ser Arg Leu 

5 10 -15 
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Gly Trp Tyx Arg Asn Lys Asp Lys Phe Leu Lys Lys Cys Ala Gin Gin 

20 25 30 

Leu Ala Lys Arg Phe 
35 



(2) INFORMATION FOR SEQ ID NO: 530: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530 : 



Met Lys His Gin Gin Lys Phe Val Pro Tyr Lys Thr Leu Gly Ala Asp 

15 10 15 

Ala Val Gly Met Ser Thr Val Pro Glu Val He Val Ala Ala His Ser 

20 25 30 

Gly Leu Lys Val Leu Gly He Ser Cys He Thr Asn Phe Ala Ala Gly 

35 40 45 

Phe Gin Glu Glu Leu Asn His Glu Glu Val Val Glu Val Thr Glu Arg 

50 55 60 

Val Lys Gly Asp Phe Lys Gly Leu Leu Lys Ala He Leu Ala Glu Leu 
65 70 75 80 

(2) INFORMATION FOR SEQ ID NO: 531: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) JTXPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 

Leu Leu Glu He Leu Asp Phe Asp Trp Ser He Phe Leu His Asp Val 
15 10 15 
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Glu Lys Thr Glu Lys Phe Val Phe Leu Leu Leu Val Phe Ser Met Ser 



20 



25 



30 



He Thr Cys Leu Leu Ala Leu Phe Trp Arg ciy lie Glu Glu 



35 



40 



Leu Ser 



45 



Leu Arg Lys Met Gin Ala Asn Leu Lys Arg Leu Leu Ala Gly Gin Glu 

50 ss 60 

Val val Gin Val Ala Asp Pro Asp Leu Asp Ala Ser Phe Lys ser Leu 



65 



70 



75 



80 



Ser Gly Lys Leu Asn Leu Leu Thr Glu Ala Leu Gin Lys Ala Glu Asn 



85 



90 



His Ser Leu Ala His Glu Glu Glu lie H e Glu Lys Glu 



100 



105 



95 

Arg Lys Arg 
110 



He Ala Arg Asp Leu His Asp Thr Val Ser Gin Glu Leu Phe Ala Ala 

115 120 125 

His Met lie Leu Ser Gly Phe Ser Gin Gin Ala Leu Lys Phe Gly 



135 



140 



(2) INFORMATION FOR SEQ ID NO: 532: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532: 
Met Asn Lys Leu He Ala Phe He Glu Lys Gly Lys 



1 



5 



Pro Phe Phe Glu 



10 



Lys Leu ser Arg Asn He Tyr Leu Arg Ala lie Arg Asp Gly Phe lie 
2<l 25 30 



Ala Gly Met Pro Val He Leu Phe Ser Ser lie Phe He Leu lie Ala 

5 40 45 

Phe val Pro Asn Ser Trp Gly Phe Lys Trp Ser Asp Ala Ser Cys Ser 



50 

Leu Ser Asp 
65 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 533: 



(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:533: 

Met lie Ser Lys Cys Cys Val Arg Trp Ala lie Pro Met Gly Leu Lys 

15 10 15 

lie lie Leu Ala Asn Met Asp Gly Arg Ser Glu Gly Glu Pro Pro Tyr 

20 25 30 

Thr Leu Leu Asp Phe Phe Pro Asp Asp Phe Leu lie Met lie Asp Glu 

35 40 45 

Ser His Met Thr lie Gly Gin lie Lys Gly Met Tyr Asn Gly Asp Arg 

50 55 60 

Ser Arg Lys Glu Met Leu Val Asn Tyr Gly Phe Arg Leu Pro Ser Ala 
65 70 75 80 

Leu Asp Asn Arg Pro Leu Arg Arg Glu Glu Phe 



(2) INFORMATION FOR SEQ ID NO: 534: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE- DESCRIPTION : SEQ ID NO: 534: 

Met Lys Gin Val Glu Ala Lys Glu Glu Glu Gin Glu Gin Ala Glu Arg 

15 10 15 

Glu Ala Glu Ser Lys Lys Glu Pro Tyr lie Tyr Tyr lie Leu Ser Phe 

20 25 30 

Ala Lys Leu Ala Asp Leu Val Ala Phe Ala Lys Thr Val Thr Phe Glu 

35 40 45 

Met Glu Thr Ser Glu Leu Tyr Lys Met Asn Glu Arg Tyr Tyr Gly Leu 
50 55 60 



85 



90 
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Phe Asp Arg Leu 
65 



(2) INFORMATION FOR SEQ ID NO: 535: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535 : 
Met Lys Ala Pro Gin Ala Ala Gly lie lie His Ser Asp Phe Glu Lys 

Gly Phe lie Arg Ala Val Thr Met Ser Tyr Glu Asp Leu Val Lys Tyr 

n c 20 " 30 

Gly Ser Glu Lys Ala Val Lys Lys Ala Gly Arg Leu Arg Glu Glu Gly 



40 



45 



Lys Glu Tyr lie Val Gin Asp Gly Asp lie Met Glu Phe Arg Phe Asn 



50 



55 



Val 
65 



60 



(2) INFORMATION FOR SEQ ID NO: 53 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) , TOPOLOGY: linear - 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 6: 
Met lie Lys Thr Thr Ser Leu Val Val Leu lie Gly Val Val Glu Val 
Thr Lys Val Gly Gin Gin lie lie Asp Ser Asn Arg Leu 



20 



25 



464 



Thr lie Pro 
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Thr Gly Ser Phe Trp Val Tyr Gly Thr lie Leu Val Leu Tyr Phe Ala 

35 40 45 

Val Cys Tyr Pro lie Ser Lys Leu Ser Thr His Leu Glu Lys His Trp 

50 55 60 

Ser Asn 
65 

(2) INFORMATION FOR SEQ ID NO: 537: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537 : 

Met Glu Val Val Met Asp Asn He He Asp Val Ser He Pro Val Ala 

15 10 15 

Glu Val Val Asp Lys His Pro Glu Val Leu Glu He Leu Val Glu Leu 

20 25 30 

Gly Phe Lys Pro Leu Ala Asn Pro Leu Met Arg Asn Thr Val Gly Arg 

35 40 45 

Lys Val Ser Leu Lys Gin Gly Ser Lys Leu Ala Gly Thr Pro Met Asp 

50 55 60 

Lys He Val Arg Thr Leu Glu Ala Asn Gly Tyr Glu Val He Gly Leu 
65 70 75 80 

Asp 



(21 .INFORMATION FOR SEQ ID NO: 538 : 

(i) SEQUENCE CHARACTERISTICS: 
(A} LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538: 
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Met lie Phe Leu He Arg Met lle ^ ^ ^ ^ ^ ^ ^ ^ 

5 10 IS 

Leu lie Leu Val Ala Phe Ala Val Met Ser Trp Phe Pro Gly Ala Tyr 

20 25 
Glu ser Ser Leu Gly Arg Trp lie Val Ala Leu Val Lys Pro Val Leu 

40 45 
Ala Pro Leu Gin Arg Leu Pro Leu Gin He Ala Gly Pro 
50 55 



(2) INFORMATION FOR SEQ ID NO: 539 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539: 
Met Lys Tyr He Ser Tyr Arg Ser Glu Pro Asn Val Asn Pro Glu Ser 

Thr Thr Glu Thr Phe Thr Ser Gly Ala Phe Phe Val Asp Ser As'p Arg 

30 

Phe Arg Gly Val Arg Ser Phe Leu Phe Pro Tyr Arg 



35 40 



(2) INFORMATION FOR SEQ ID NO: 540: 

(i) SEQUENCE CHARACTERISTICS: 

(A) .LENGTH: -71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540: 
Met Thr Lys He Ser He Tyr Glu Leu Val Lys Arg Ala Gly Val Ser 



10 



466 



15 
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Arg Ala Ala Phe Tyr Arg Asn Tyr Asp Ser Lys Glu Glu He Ser Glu 

20 25 30 

Ser Val Phe Lys Arg Thr Val His Asn He Met Glu Gin Met His His 

35 40 45 

Tyr Asp Leu Lys Thr Asp Leu Tyr Val Val Cys Phe Thr Phe Ser Gly 

50 55 60 

Arg Pro Glu Arg Lys Leu Glu 



(2) INFORMATION FOR SEQ ID NO: 541: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541: 

Leu His Gin Glu Pro Ser Leu Pro Ala Thr Val Thr Val Glu Tyr Asp 

1 5 10 15 

Lys Gly Phe Pro Lys Thr His Lys Val Thr Trp Gin Ala He Pro Lys 

20 25 30 

Asp Lys Val Asp Ser Tyr His Thr Phe Glu Val Leu Gly Lys Val Glu 

35 40 45 

Gly He Asp Leu Glu Ala Arg Ala Lys Val Ser Val Asp Gly He Val 

50 55 60 

Ser Val Glu Glu Val Ser Val Thr Thr Pro He Ala Glu Ala Pro Gin 
65 70 75 8Q 

Leu Pro Glu Ser Val Arg Thr Tyr Asp Ser Asn Gly His Val Ser Ser 

85 90 95 

Ala Lys Val Ala Trp Asp Ala He Arg Pro Glu Gin Tyr Ala Lys Glu 

100 105 no 

Gly Val Phe Thr Val Asn Gly Arg Leu Glu Gly Thr Gin Leu Thr Thr 

115 120 125 

Lys Leu His Val Arg Val Ser Ala Gin Thr Glu Gin Gly Ala Asn He 

130 135 140 

Ser Asp Gin Trp Thr Gly Ser Glu Leu Pro Leu Ala Phe Ala Ser Asp 
145 150 155 160 

Ser Asn His Lys Arg Pro Ser Phe Lys Cys Ser 



70 



165 



170 
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(2) INFORMATION FOR SEQ. ID NO: 542: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 
Met Glu Leu Arg Arg Pro Arg Leu Ala Asp Lys Lys Ala Val Leu Asp 



10 15 



Met Met Thr Glu Phe Glu Lys Ser Gin Ser Ala His Asp Gly Gly Phe 

20 25 30 

Trp Asp Thr Glu Asn Phe Val Tyr Glu Glu Trp Leu Glu Ser Asn Gin 



35 



40 



45 



Glu Gin Glu Met Gly lie Asn Leu Pro Glu Gly Trp Val Pro Ala lie 

50 55 60 

Gin Leu Val Ala Phe Ser Glu Lys Gly Gin Ala Val Gly Phe Leu Asn 

" 70 7 * 80 

Leu Arg Leu Arg Leu Ser Asn Phe Leu Leu Glu Glu Gly Gly His lie 

85 90 95 

Gly Tyr Ser lie Arg Pro Ser Glu Arg Gly Lys Gly Tyr Ala Lys Glu 

100 "5 110 

Leu Ser Val Arg Ala Cys Lys Leu Leu Arg Lys Arg Thr Ser Arg Lys 



Leu Trp 
130 



115 "o 125 



(21 INFORMATION FOR SEQ ID NO: 543 : 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 98 amino acids 
<B) TYPE: amino acid 

(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:543: 
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Leu Leu Leu He Phe Ala Val He Gly Val Leu Val 

15 10 
Gin Phe Tyr Ser Ala Lys Ala Ala Val Gly Phe Ala 

20 25 
Asn Asp Leu Tyr Arg His He Leu Ser Leu Pro Lys 

35 40 
Arg Leu Thr Thr Ser Ser Leu Val Thr Arg Leu Thr 

50 55 60 

Gin He Gin Thr Gly He Asn Gin Phe Leu Leu Ser 
65 70 75 

Ala His He Ser Phe Leu Val Pro Phe Leu Trp Leu 
85 90 

Leu Ser 



Ala Leu He Ala 
15 

Lys Glu Leu Thr 
30 

Asp Ser Arg Asp 
45 

Ser Asp Thr Tyr 

Leu Phe Thr Ser 
80 

He Glu Ser Gin 
95 



(2) INFORMATION FOR SEQ ID NO: 544: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544: 

Met Ala Tyr Arg He Ser Ala Glu Leu Thr Phe Trp Phe Leu He Met 

15 10 15 

Val Ala He Leu Thr lie Val He Val Arg Val He Ser He Gly Gin 

20 25 30 

Ser Ser Leu ^Gln - 
35 

(2) INFORMATION FOR SEQ ID NO: 545: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545 ; 

Met Thr Thr Phe Lys Asp Arg Phe Leu Trp Gly Gly Ala Val Ala Ala 
5 10 15 



His Gin Leu Glu Gly Gly Trp Gin Glu Gly Gly Lys Gly U 



20 



e Ser Val 



25 



Ala Asp Val Met Thr Ala Gly Arg His 
35 40 



30 



(2) INFORMATION FOR SEQ ID NO: 546 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 6: 

Met Leu Asp Met Leu Arg Asp Met Gin Asp Lys Ala Arg Lys Asn Lys 

lie Asp Phe Ala Val Ala Gly Tyr Leu Asn Thr Ser Phe lie gL Lys 

20 25 
Met Asn Gin Leu Gly He Lys Cys lie lie His Tyr Ser Hr lie Pro 



35 



40 



45 



Glu lie Phe Asp Leu Glu lie Asp His Pro Asp His Leu Lys His lie 



50 

Lys Glu Glu Lys 
65 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 547; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547: 



Met Lys Asn Val Glu Leu Lys Glu 

1 5 
Leu Pro Gly Leu Lys Ala Lys Arg 
20 

Gly Ala Glu Asn Tyr Val Gin Leu 

35 40 
Leu Ala Leu Glu Met Thr Pro Tyr 

50 55 
Gly Glu Gly Phe Ser His Val Gly 
65 70 
Gin Arg lie Gly Leu Glu Leu His 
85 



Glu Asn Met Thr Phe Glu Glu He 

10 15 
Lys Tyr Val Arg Thr Gly Trp Gly 
25 30 
Phe Asp Thr He Glu Gin Asn Gly 
45 

Phe Leu He Asn Val Ser Gly Glu 
* 60 

Ala Arg Gin Phe Val Met Phe Trp 
75 80 

Asp 



(2) INFORMATION FOR SEQ ID NO: 548: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 548: 



Leu Asp Ser Gin Thr Val Ser Ala 

1 5 
Glu Gly Thr Asp Leu Ala He Xaa 
20 

Tyr Leu Lys Pro Asp Thr Ser Ser 

35 40 
Lys Glu Met Arg Thr Ala Ala Asp 

50 55 
Gin He Thr Asn Glu Asn Tyr Met 
65 70 
Leu Ser Gin Met Asn Leu Arg Val 
85 

Cys Leu 



Lys Gly Tyr His Phe Pro Leu Ser 

10 15 
Thr Ser Glu Gly Thr Thr Ser Gin 
25 30 
Tyr Phe Ser Lys Ser Ala Tyr Glu 
45 

Lys Tyr Leu Ser Gin Asp Ser He 
60 

Gly Gly His Gly Gly Tyr Leu Arg 

75 80 
Lys Thr He Gin Phe Asn Arg Leu 
90 95 
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(2) INFORMATION FOR SEQ ID NO: 549: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 120 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549 : 

Met Lys Lys Leu Val P he Val Cys Leu Gly Asn He Cys Ar g Thr Pro 

10 15 

Met Ala Glu Phe Val Met Lvs Ser M»t- tv,^ » 

ec j^ys ser Met Thr Asp Asn Tyr Glu lie Gin 

25 

Ser Arg Ala Thr Ser Ser Trp Glu His Gly Asn Pro lie His Lys Gly 

40 

Thr «. 0ly lle ph . G1 „ „. ^ ^ ne ^ ^ ^ ^ ^ 
* s. r „ oln Ile ^ » Glu ^ £ ^ ^ ^ ^ 

n. ay Met „. _ ta val ser ^ p ^ ^ ■ » 

90 

« His cy, Oln A8P Lys Ile ^ ser phe Sm ^ ^ ^ £ ^ 



Glu Pro Trp Tyr Thr Gly Arg Phe 
115 120 



105 



110 



(2) INFORMATION FOR SEQ ID NO: 550 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : -88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 

Met Lys Thr Thr Phe Ser ^ Pro Lys Trp Ala Glu. lie Pro Asn lie 

5 10 15 
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Asp Leu Tyr Leu Asp Gin Val Leu Leu Tyr Val Asn Gin Val Cys Ala 

20 25 30 

Pro He Ser Pro Asn Lys Asp Lys Gly Leu Thr Ala Ser Met Val Asn 

35 40 45 

Asn Tyr Val Lys Asn Gly Tyr Leu Thr Lys Pro Asp Lys Lys Lys Tyr 

50 55 60 

Gin Arg Gin Gin He Ala Pro Phe Asp Cys Tyr His Asn Pro Gin Val 
65 70 75 

Leu Tyr Phe Pro He Pro Arg Lys 
85 



80 



(2) INFORMATION FOR SEQ ID NO: 551: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551: 

Met Pro Thr Ala Phe Tyr Leu Phe Phe Ser Ser Met Tyr Gin Asp Thr 

1 5 10 15 

Pro Gly Gly Pro Ala Asn Phe Met Arg Asp Tyr Leu He Ser Met Thr 

20 25 30 

Ala Phe Ser Met Met Ser Thr Ala lie Phe Ser Phe Pro Val Val Leu 

35 40 45 

His Thr Asp Lys Met Ser Asn Trp Gin Lys Thr Leu Arg His Ser Pro 

50 55 6 o 

Val Asn Met Val Glu Tyr Tyr Leu Ser Lys He Thr Val Cys Trp Val 
65 70 75 80 

Asp Tyr Leu Val Phe His Ser Trp Trp Phe Phe Leu Ser Leu Gly Phe 

85 90 95 

Leu Leu Glu Gly Val Gly Leu Val Leu 
100 105 



(2) INFORMATION FOR SEQ ID NO: 552: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

473 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552: 

Met Ala Ser Pro Tyr Tyr Leu Asp Glu Glu Ala Leu Lys Tyr He Asp 

15 " 15 

Tyr Asp Leu Asp Val Lys He Phe Thr Asp Gly Glu Lys Arg Leu Leu 

20 25 30 

Asp Val Glu Glu Tyr Glu Arg His Lys Arg Lys Met Asn Tyr Ser Asp 

35 40 45 

Asp Leu Asp Tyr He Leu Lys Glu His Val Lys He Leu Val Asp Trp 

50 55 60 

He Asn Asn Gly Arg Gly Pro Phe Ser Glu Ala Tyr Val Asn He Trp 
65 70 75 80 

Tyr Lys Arg Tyr Val Glu Leu Lys Asn Arg 
85 90 
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WHAT IS CLAIMED IS: 



10 



1 . An isolated polynucleotide comprising a polynucleotide sequence selected from 
the group consisting of: 

(a) a polynucleotide having at least a 70% identity to a polynucleotide encoding a 
polypeptide comprising an amino acid sequence of selected from the group consisting of SEQ ID 
NO:263 through 552, 

(b) a polynucleotide which is complementary to the polynucleotide of (a); 

(c) a polynucleotide having at least a 70% identity to a polynucleotide encoding the 
same mature polypeptide expressed by the a gene comprising a polynucleotide sequence selected 
from the group consisting of SEQ ID NO: 1 through 262 contained in the DNA of the deposited 
clone; and 

(d) a polynucleotide comprising at least 1 5 sequential bases of the polynucleotide 
of (a), (b)or(c). 

1 5 2. The polynucleotide of Claim 1 wherein the polynucleotide is DNA. 

3. The polynucleotide of Claim 1 wherein the polynucleotide is RNA. 

4. The polynucleotide of Claim 2 comprising the nucleic acid sequence set forth in 
SEQ ID NO: 1 THROUGH 262. 

5. The polynucleotide of Claim 2 comprising a nucleotide sequence selected from 
20 the group consisting of SEQ ID NO:l through 262. 

6. The polynucleotide of Claim 2 which encodes a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO:263 through 552. 

7. A vector comprising the polynucleotide of Claim 1 . 

8. A host cell comprising the vector of Claim 7. 

25 9 - A process for producing a polypeptide comprising: expressing from the host 

cell of Claim 8 a polypeptide encoded by said DNA. 

10. A process for producing a noval polypeptide or fragment comprising 
culturing a host of claim 8 under conditions sufficient for the production of said polypeptide 
or fragment. 

3° 11. A polypeptide comprising an amino acid sequence which is at least 70% 

identical to an amino acid sequence selected from the group consisting of SEQ ID NO:263 
through 552. 

12. A polypeptide comprising an amino acid selected from the group consisting of 
SEQ ID NO:263 through 552. 
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13. An antibody against the polypeptide of claim 1 1. 

14. ^ anta^nist which 
claim IK ^ 

polypepddecompnsing: ^^H^^M.t^^,^,^ 

polypeptide of claim II. 

antagonist of Claim 14. 

10 17. A process for diagnosing a disease related to expression or activity of the 

polypeptide of claim 1 1 in an individual comprising: 

(a) determining a nucleic acid sequence encoding said polypeptide, and/or 

tne individual. 

"""ravuyofthepolypqxideofcUim „ c^QMne 

c ^«««^°»cor^,hepo,^ 

!™^><i^^„ r e i p ( ^ t01heu ^ 

compound; 

nueiacoon of the compound with the polypepdde 
25 19. A method for inducing „ immunologic* response in . mammal which 

comprises mocmadng the mnmmtU with Strep,^ rf ^ , , „ , 

said animal from disease. 

30 dr . 20 ' ATOth ^° findUCingimmUno,0 ^ rcs ^ 

de hvennganucleicacid vector to dire ct expression of St rep tc^ po.ypeptide of claim 1, 

orfragmentoravariant^reof.forexpressmgsaidS^^ 

or a vanant thereof in W w i„ onier to induce an immunological response to produce 

anubody and/ or T cell immune response to protect said animal from disease. 

22. An isolated nucleic acid encoding one of the amino acid sequences of 
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Claim 21 and nucleic acid sequences capable of hybridizing therewith under stringent 
conditions. 

23. Recombinant vectors comprising the nucleic acid sequences of 
Claim 22 and host cells transformed or transfected therewith. 

24. A method of identifying an antimicrobial compound comprising contacting 
candidate compounds with a polypeptide of Claim 21 and selecting those compounds 
capable of inhibiting the bioactivity of said polypeptide. 

25. Antimicrobial compounds identified by the method of Claim 24. 

26. An isolated polynucleotide comprising a polynucleotide sequence selected from 
die group consisting of: 

(a) a polynucleotide having at least a 70% identity to a polynucleotide encoding a 
polypeptide comprising an amino acid sequence ofSEQ ID NO; 389, 

(b) a polynucleotide which is complementary to the polynucleotide of (a); 

(c) a polynucleotide having at least a 70% identity to a polynucleotide encoding the 
same mature polypeptide expressed by the a gene comprising a polynucleotide sequence of SEQ 
ID NO: 389 contained in the DNA of the deposited clone; and 

(d) a polynucleotide comprising at least 15 sequential bases of the polynucleotide 



of(a),(b)or(c). 



27. A polypeptide encoded by the polynucleotide of claim 26. 
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